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LOG(F):  An  Optimal  Combination  of  Logic  Programming, 

Rewriting,  and  Lazy  Evaluation 

Sanjai  Narain 
Rand  Corporation 
1700  Main  Street 
Santa  Monica,  CA  90406 

ABSTRACT 

A  new  approach  for  combining  logic  programming,  rewriting,  and  lazy  evaluation  is 
described.  It  rests  upon  subsuming  within  logic  programming,  instead  of  upon  extending 
it  with,  rewriting,  and  lazy  evaluation. 

A  non-terminating,  non-deterministic  rewrite  rule  system,  F*  and  a  reduction  strategy  for 
it,  select,  are  defined.  F*  is  shown  to  be  reduction-complete  in  that  select  simplifies  terms 
whenever  possible.  A  class  of  F*  programs  called  Deterministic  F*  is  defined  and  shown 
to  satisfy  confluence,  directedness,  and  minimality.  Confluence  ensures  that  every  term 
can  be  simplified  in  at  most  one  way.  Directedness  eliminates  searching  in  simplification 
of  terms.  Minimality  ensures  that  select  simplifies  terms  in  a  minimum  number  of  steps. 
Completeness  and  minimality  enable  select  to  exhibit,  respectively,  weak  and  strong 
forms  of  laziness. 


F*  can  be  compiled  into  Horn  clauses  in  such  a  way  that  when  SLD-resolution  interprets 
these,  it  directly  simulates  the  behavior  of  select.  Thus,  SLD-resolution  is  made  to 
exhibit  laziness.  LOG(F)  is  defined  to  be  a  logic  programming  system  augmented  with 
an  F*  compiler,  and  the  equality  axiom  X=X.  LOG(F)  can  be  used  to  do  lazy  functional 
programming  in  logic  ,  implement  useful  cases  of  the  rule  of  substitution  of  equals  for 
equals,  and  obtain  a  new  proof  of  confluence  for  combinatory  logic.  (  H 
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L  INTRODUCTION 


1.1  THE  PROBLEM 

Logic  programming  [Kowalski  1979],  is  the  use  of  statements  of  logic  as  computer 
programs.  It  has  led  to  new  insights  into  computing  as  well  as  logic.  Rewriting  is 
synonymous  with  reduction,  as  described,  for  example,  in  [Knuth  &  Bendix  1970].  It  is 
simplification  of  an  expression  by  successive  application  of  some  collection  of  rewrite 
rules.  Its  usefulness  is  evident  from  its  appearance  in  many  branches  of  mathematics. 

Lazy  evaluation,  e.g.  [Vuillemin  1974],  is  a  method  of  computing  which  ensures  that  a 
computation  step  is  performed  only  when  there  is  need  to  perform  it.  Thus,  not  only  does 
it  enable  certain  computations  to  terminate  more  quickly,  it  also  enables  computation  with 
infinite  data  structures. 

A  system  in  which  logic  programming,  rewriting,  and  lazy  evaluation  were  combined 
could  put  considerable  programming  power  at  our  disposal.  In  particular,  it  would 
simultaneously  afford  the  expressive  power  of  both  functions,  and  relations. 

Furthermore,  such  a  system  could  be  used  to  implement  instances  of  the  rule  of 
substitution  of  equals  for  equals  in  logical  statements.  This  is  a  very  important  rule,  as 
witness  its  use  in  the  simplest  of  mathematical  derivations,  e.g.  solution  of  trigonometric 
identities.  Logical  statements  could  be  expressed  using  logic  programs,  while  equality 
theories  could  be  expressed  using  rewrite  rules. 

We  propose  a  new  approach  for  building  the  above  system  which  is  rigorous,  as  well  as 
computationally  efficient.  It  rests  upon  subsuming  within  logic  programming,  instead  of 
extending  it  with,  rewriting  and  lazy  evaluation.  This  means  that  SLD-resolution,  the 
proof  procedure  used  for  logic  programming,  is  not  changed.  Instead,  Horn  clauses,  or 
pure  Prolog  clauses  are  written  in  such  a  way,  that  when  SLD-resolution  interprets  them, 
it  directly  simulates  lazy  rewriting.  The  resulting  system  is  called  LOG(F).  It  can  be  said 
to  make  contributions  to  the  following  three  areas: 

1.  Rewriting.  Simple,  syntactic  conditions  are  defined  under  which  non¬ 
terminating,  non-deterministic  rewrite  rules,  with  pattern  matching,  satisfy  useful 
computational  properties.  These  are  regarding  demand-driven  reduction, 
confluence,  elimination  of  search  during  reduction,  and  lengths  of  reductions. 

2.  Combination  of  logic  programming  and  rewriting.  It  is  shown  how 
rewriting  can  be  subsumed  within  logic  programming.  In  particular,  the  above 
computational  properties  of  rewriting  are  realized  within  logic  programming, 
without  changing  it,  and  without  sacrificing  logical  rigor.  This  has  two  important 
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consequences. 

First,  a  satisfactory  combination  of  logic  programming,  and  rewriting  is  achieved, 
without  developing  a  new  computational  model  of  which  the  two  are  instances. 
Developing  such  a  model  is  quite  difficult,  particularly  if  it  is  to  have  satisfactory 
declarative,  and  satisfactory  procedural  semantics.  Second,  full  advantage  is 
taken  of  the  very  efficient  implementations  of  Prologs.  Thus,  formidable 
problems  that  implementation  of  the  new  model  would  very  likely  pose,  are 
avoided. 

3.  Lazy  evaluation.  By  1  and  2,  it  is  shown  how  lazy  evaluation  can  be  done 
efficiently,  within  the  normally  eager  framework  of  logic  programming.  Thus  a 
basis  is  established  for  understanding  lazy  evaluation  purely  in  terms  of  well 
understood  ideas  in  first  order  logic.  Also,  a  new,  and  powerful  use  is  found  for 
an  old,  and  widely  used  tool,  namely,  Prolog. 

1.2  SUMMARY  OF  MAIN  RESULTS 

1.2.1  A  rewrite  rule  system  F* 

A  first-order  rewrite  rule  system  F*,  with  pattern  matching,  is  defined.  The  function 
symbols  are  partitioned  in  advance  into  constructors,  and  non-constructors. 

Simplification  in  F*  means  reducing  ground  terms  to  simplified  forms  i.e.  terms  of  the 
form  c(tl,..,tn)  where  c  is  a  constructor  symbol,  and  each  of  tl,..,tn  is  a  ground  term. 
Simplified  forms  can  be  used  to  represent  finite  approximations  to  infinite  structures,  and 
are  analogous  to  head-normal  forms  in  the  lambda-calculus  [Wadsworth  1976].  In 
contrast,  a  normal  form  is  defined  to  be  a  term  in  which  all  function  symbols  are 
constructors. 

Now,  an  important  point  is  that  a  method  for  computing  simplified  forms  can  be  used 
repeatedly  to  compute  normal  forms.  Moreover,  it  would  terminate  more  often  than 
would  a  method  which  directly  computes  normal  forms.  Hence  it  is  sufficient  to  develop, 
and  study  properties  of,  a  method  for  computing  simplified  forms. 

An  F*  program  is  a  finite  set  of  rules,  each  of  the  form  LHS=>RHS,  satisfying  the 
following  restrictions:  (1)  LHS  is  of  the  form  f(Ll,..,Lm),  m>=0,  f  a  non-constructor 
function  symbol,  and  each  Li  either  a  variable,  or  of  the  form  c(Tl,..,Tn),  n>=0,  c  a 
constructor  symbol,  and  each  Ti  a  variable,  (2)  a  variable  occurs  at  most  once  in  LHS, 
and  (3)  all  variables  of  RHS  occur  in  LHS.  Note  that  non- terminating,  non-deterministic 
sets  of  rewrite  rules  are  permissible.  Also  any  rule  with  left  hand  side  of  depth  greater 
than  two  can  easily  be  expressed  in  terms  of  rules  with  left  hand  sides  of  depth  at  most 
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two,  as  required  by  (1). 

Where  P  is  an  F*  program,  and  f(Tl,..,Tn)  a  ground  term,  a  reduction  strategy  for  P, 
selectp,  is  defined  by  the  following  pseudo-Horn  clauses: 

selectp(f(Tl,..,Tn),f(Tl,..,Tn))  if  f(Tl,..,Tn)=>pX. 
selectp(f(Tl,..,Ti,..,Tn),X)  if 

there  is  a  rule  f(Ll,..JLi,..,Ln)=>RH$  in  P,  and 
there  is  no  substitution  a  such  that  Ti=Lia,  and 
selectp(Ti.X). 

Here  A=>B  means  there  is  a  rule  LHS=>RHS  such  that  A  matches  LHS  with  substitution 
cr,  and  B  is  RHSa.  Select  is  shown  to  be  reduction-complete,  in  that  if  a  term  can  be 
simplified,  it  can  be  simplified  by  reducing  it  via  select.  Thus  select  exhibits  a  weak  form 
of  laziness.  An  example  of  an  F*  program  is: 

peim([])=>[]. 

perm([AIV])=>insert(U,perm(V)). 

insert(U,X)=>[UIX]. 

insert(U,[AIB])=>[Alinsert(U,B)]. 

Here  [],  I  are  constructors,  while  perm,  insert  are  non-constructors.  The  term 
perm([l,2,3])  is  now  reduced  by  select  to  [llperm([2,3])],  [2linsert(l,perm([3]))],  and 
[3linsert(l,insert(2,perm([])))].  If  further  reduction  is  desired,  select  may  be  called 
recursively  upon  the  arguments  of  I  to  yield  each  of  [1,2,3],  [1,3,2],  [2,3,1],  [2,1,3], 
[3,1,2],  [3,2,1]. 

1.2.2  Deterministic  F* 

An  F*  program  P  is  a  DF*  program  if  (1)  left  hand  sides  of  no  two  rules  in  P  unify,  and 
(2)  where  f(Ll,..,Li,..,Lm)=>RHS  is  a  rule  in  P,  and  Li  is  not  a  variable,  then  in  every 
other  rule  f(Kl,..,Ki,..,Km)=>RHSl  in  P,  Ki  is  not  a  variable.  These  restrictions  are  very 
reasonable,  and  as  examples  throughout  this  paper  show,  it  is  possible  to  adhere  to  these, 
yet  write  quite  expressive  programs. 

DF*  is  shown  to  satisfy  confluence,  directedness,  and  minimality.  Confluence  ensures 
that  a  term  can  be  simplified  in  at  most  one  way.  Directedness  ensures  that  to  simplify  a 
term  it  is  sufficient  to  compute  any  reduction  computable  by  select.  Moreover,  all 
reductions  computable  by  select  are  of  equal  length.  Thus,  during  reduction,  no 
searching  is  necessary.  Provided,  whenever  a  term  is  reduced,  all  copies  of  it  are 
simultaneously  reduced,  minimality  ensures  that  select  simplifies  terms  in  a  minimum 
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number  of  steps.  Thus,  select  exhibits  a  strong  form  of  laziness.  An  example  of  a  DF* 
program  is: 

append([],X)=>X. 

append([UI  V] ,  W)=>[Ulappend(  V,W)] . 

interleave([UIV],X)=>[Ulinterleave(X,V)]. 

a=>[lla]. 

b=>[2lb]. 

However,  the  F*  program  above  to  insert  an  element  non-deterministically  into  a  list  is 
not  in  DF*. 

1.2.3  Compiling  F*  into  Horn  clauses 

F*  programs  can  be  compiled  into  Horn  clauses  in  such  a  way  that  when  SLD-resolution 
interprets  these,  it  directly  simulates  the  behavior  of  (the  interpreter  based  upon)  select. 
This  means  that  there  is,  essentially,  a  one-to-one  correspondence  between  steps  executed 
by  (the  interpreter  based  upon)  select,  and  steps  executed  by  SLD-resolution.  This  is 
accomplished  by  translating  each  F*  rule  into  a  distinct  Horn  clause,  and  simultaneously 
embodying  in  that  clause,  information  about  the  logic  of  the  rule,  as  well  as  information 
about  the  control  of  select  when  interpreting  that  rule. 

Thus,  SLD-resolution  is  made  to  exhibit  laziness.  If  the  F*  program  is  also  in  DF*, 
clauses  can  be  further  transformed  to  eliminate  all  backtracking.  Finally,  clauses  can  be 
compiled  into  machine  code  by  Prolog  compilers.  The  compilation  algorithm  consists  of 
two  steps: 

Step  1.  For  each  n-ary,  n>=0,  constructor  symbol  c  in  P,  and  where  Xl,..,Xn  are  distinct 
variables,  generate  the  clause: 

reduce(c(X  1  ,..,Xn),c(X  1  ,..,Xn)) 

Step  2.  Let  f(Ll,..,Lm)=>RHS  be  a  rule  in  P.  Let  Al,..,Am,Out  be  distinct  Prolog 
variables  not  occurring  in  the  rule.  If  Li  is  a  variable  let  Qi  be  Ai=Li.  If  Li  is  c(Xl,..,Xn) 
where  c  is  a  constructor  symbol,  and  each  Xi  a  variable,  let  Qi  be  reduce(Ai,c(X  1  ,..,Xn)). 
Generate  the  clause: 

reduce(f(Al,..,Am),Out):-Ql,..,Qm,reduce(RHS,Out). 

In  practice,  if  Li  is  a  variable,  Qi  can  be  dropped,  provided  Ai  is  replaced  by  Li  in 
f(Al,..,Am).  For  example,  the  above  F*,  and  DF*  programs  are  compiled  into: 
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reduce([],[]). 

reduce([UIV],[UIV]). 

reduce(perm(X),Z):-reduce(X,[]),reduce([],Z). 

reduce(perm(X),Z):-reduce(X,[FXIRX}),reduce(insert(FX,perm(RX)),Z). 

reduce(insert(A,X),Z):-reduce([AIX],Z). 

reduce(insert(A,X),Z):-reduce(X,[FXIRX]),reduce([FXIinsert(A,RX)],Z). 

reduce(append(X,Y),Z):-reduce(X,[]),reduce(Y,Z). 

reduce(append(X,Y),Z):-reduce(X,[UIV]),reduce([Ulappend(V,Y)],Z). 

reduce(interleave(X,Y),Z):-reduce(X,[UIV]),reduce([Ulinterleave(Y,V)],Z). 

reduce(a,Z):-reduce([lla],Z). 

reduce(b,Z):-reduce([2lb],Z). 

If  we  now  type,  in  Prolog,  reduce(perm([l,2,3]),Z),  we  obtain  Z=[llperm([2,3])], 
Z=[2linsert(l,perm([3]))],  and  Z=[3linsert(l,insert(2,perm([])))].  Note  the  following: 
First,  perm([l,2,3])  is  only  partially  reduced,  and  directly  by  Prolog,  not  by  some  lazy 
interpreter  implemented  in  Prolog.  Second,  the  terms  to  which  Z  is  bound  are  exactly 
those  to  which  perm([l,2,3])  is  reduced  by  select.  This  illustrates  Prolog  simulating 
behavior  of  select.  If  we  now  define: 

first(0,X,[]). 

first(N,X,[FXIZ]);-not(N=0)jeduce(X,[FXIRX]),Nl  is  N-l,first(Nl,RX,Z). 
make_list(E,[]):-reduce(E,[]). 

make_list(E,[FEIZ]):-reduce(E,[FElRE]),make_list(RE,Z). 

print_list(X):-reduce(X,rFXIRX]),write(FX),write(\’),print_list(RX). 

and  then  type,  make_list(perm([l,2,3]),Z),  we  obtain  Z=[  1,2,3],  Z=[l,3,2],  Z=[2,l,3], 
Z=[2,3,l],  Z=[3,l,2],  Z=[3,2,l].  As  further  examples,  if  we  type  the  queries  on  the  left- 
hand  side,  we  obtain  the  answers  on  the  right-hand  side: 

reduce(append(a,b),Z)  — >  Z=[llappend(a,b)] 
reduce(interleave(a,b),Z)  — >  Z=[llinterleave(b,a)] 
first(5,interleave(a,b),Z)  — >  Z=[l,2,l,2,l] 
print_list(interleave(a,b))  — >  1,2, 1,2, 1,2, . 

1.2.4  LOG(F) 

LOG(F)  is  defined  to  be  a  logic  programming  system  augmented  with  an  F*  compiler, 
and  the  equality  axiom  X=X.  The  result  of  compilation  is  to  add  to  a  logic  programming 
system,  a  primitive  for  lazily  simplifying  F*  terms.  This  primitive  can  be  called  from 
other  Horn  clauses,  so  LOG(F)  is  proposed  as  a  combination  of  logic  programming. 
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rewriting  and  lazy  evaluation. 

For  problems  for  which  lazy  evaluation  does  not  reduce  lengths  of  computation,  e.g. 
sorting,  or  all  permutations,  LOG(F)  is  empirically  found  to  be  about  five  times  slower 
than  Prolog.  For  problems  for  which  lazy  evaluation  does  reduce  lengths  of  computation, 
e.g.  N-queens,  LOG(F)  is  faster  than  Prolog  by  unbounded,  even  infinite,  amounts. 

In  the  literature,  [Vuillemin  1974,  Berry  &  Levy  1979],  optimality  is  used  synonymously 
with  minimality.  Due  to  minimality  of  DF*,  LOG(F)  can  also  be  said  to  be  optimal.  It 
can  also  be  said  to  be  so  in  a  weaker  sense,  because  of  its  desirable  computational 
properties,  and  their  economical  realization  in  Prolog. 

1.2.5  Applications  of  LOG(F) 

LOG(F)  can  be  used  to  do  lazy  functional  programming  in  logic.  In  particular,  it  can  be 
used  to  manipulate  representations  of  infinite  structures,  such  as  in  real  analysis,  exact 
real  arithmetic,  graphics,  or  networks  of  communicating  processes. 

The  SKI  rules  of  combinatory  logic  can  be  expressed  as  a  DF*  program.  From 
confluence  of  DF*.  a  new  proof  is  obtained  of  the  confluence  of  combinatory  logic. 

DF*  seems  to  offer  a  reasonable  compromise  between  sequential  execution  and 
unbounded  parallelism.  Due  to  directedness  of  DF*,  arguments  of  f  in  f(tl,..,tm)  can  be 
simplified  in  parallel,  however,  they  would  be  simplified  lazily.  Thus,  DF*  seems  to  be  a 
good  candidate  for  implementation  on  parallel  machines. 

Finally,  if  a  DF*  program  is  interpreted  as  an  equality  theory,  reduce  clauses  can  be 
thought  of  as  implementing  an  equality  theory  in  Prolog  with  the  restriction  that  it  be  used 
only  for  simplification  of  terms.  Now,  given  a  clause  of  the  form  p(c(Xl,..,Xm)):-Body, 
where  c  is  a  constructor  symbol,  we  can  add  another  clause  stating  a  rule  of  substitution 
of  equals: 

p(X):-reduce(X,c(X  1 ,.  .,Xm)),p(c(X  1 ,..  ,Xm)). 

Now,  even  when  a  term  E  is  not  of  the  form  c(Xl,..,Xm),  p  can  still  be  inferred  for  E, 
provided  E  is  reducible  to  a  term  of  the  form  c(Xl,..,Xm).  For  example,  with  the  Prolog 
rule  for  computing  perimeters  of  regular  polygons,  peri(reg_poly(N,S),Z):-Z  is  N*S,  we 
can  infer  peri(reg_poly(3,10),30).  We  can  now  add  the  clause: 


peri(X,Y):-reduce(X,Z),peri(Z,Y). 
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Where  reg_poly  is  a  constructor,  and  equi,  square,  and  hexagon  are  non-constructors,  an 
equality  theory  among  polygons,  expressed  in  F*,  is: 

equi(S)=>reg_poly(3,S). 

square(S)=>reg_poly(4,S). 

hexagon(S)=>reg_poly(6,S). 

This  is  compiled  into: 

reduce(reg_poly(A,B),reg_poly(A,B)). 

reduce(equi(S),Z):-reduce(reg_poly(3,S),Z). 

reduce(square(S),Z):-reduce(reg_poly(4,S),Z). 

reduce(hexagon(S),Z):-reduce(reg_poly(6,S),Z). 

The  Prolog  query  peri(equi(10),30),  now  succeeds.  Thus  Prolog  automatically  infers  the 
result  of  substituting  equi(10)  for  reg_poly(3,10),  in  peri(reg_poly(3,10),30).  Of  course, 
if  we  type  peri(square(3),Z),  we  obtain  Z=12. 

1.3  RELATIONSHIP  WITH  PREVIOUS  WORK 

There  seem  to  be  two  major  approaches  to  combining  logic  programming,  and  rewriting. 
The  first  consists  of  implementing  logic  programming  in  rewriting,  e.g.  LOGLISP 
[Robinson  &  Sibert  1982],  or  QLOG  [Komorowski  1982].  However,  it  seems  difficult 
for  such  an  approach  to  lead  to  an  efficient  system  since  logic  programs  must  pass 
through  two  high-level  layers  of  interpretation. 

The  second  approach  consists  of  developing  a  new  computational  model  of  which  both 
rewriting,  and  logic  programming  are  instances.  Examples  of  such  models  include  those 
based  upon  upon  semantic-  or  T-unification,  [Goguen  &  Meseguer  1986], 
[Subrahmanyam  &  You  1984],  [Komfeld  1983],  sets,  [Robinson  1987],  [Darlington  et  al. 
1986],  narrowing,  [Reddy  1985],  the  Knuth-Bendix  completion  procedure,  [Dershowitz 
&  Josephson  1984],  oriented  equational  clauses,  [Fribourg  1984],  residuation,  [Ait-Kaci 
&  Nasr  1987],  extension  of  SLD-resolution  with  narrowing,  [Yamamoto  1987],  or 
extension  of  SLD-resolution  with  atom-elimination  rule,  [Barbuti  et  al.  1986]. 

In  order  for  a  new  computational  model  to  be  satisfactory,  it  must  posssess  not  only  good 
declarative  semantics,  but  also  good  procedural  semantics.  The  former  is  essential  for 
reasoning  about  programs  in  the  model.  The  latter  means  that  the  behavior  of  the  model 
is  simple  enough  that  it  can  be  visualized,  predicted,  and  controlled.  It  is  essential  if  the 
model  is  to  be  used  for  programming,  i.e.  for  expressing  algorithms.  In  this  regard,  we 
also  quote  Robinson  [1984]: 
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...one  guiding  principle  must  surely  be  that  logic  programming,  however  narrowly  or  broadly 
construed,  essentially  involves  the  ingredient  of  practicality.  The  underlying  deductive  processes 
should  have  enough  directness  and  predictability  to  permit  the  planning  of  efficient  logical 
computations.  Herein  probably  lies  the  important  distinction,  difficult  to  make  precise  but 
nonetheless  real,  between  logic  programming  proper  and  automatic  deduction  in  general. 

However,  developing  a  satisfactory  computational  model,  more  general  than  logic 
programming  and  rewriting,  is  a  very  ambitious  undertaking,  particularly  if  the  model  is 
also  to  exhibit  laziness.  In  particular,  it  appears  that  each  of  the  above  models,  with  the 
possible  exception  of  [Robinson  1987],  and  [Darlington  et  al.  1986],  either  has  complex 
declarative  semantics,  or  complex  procedural  semantics. 

Of  course,  even  if  a  satisfactory  computational  model  is  developed,  its  efficient 
implementation  on  concrete  machines  can  still  pose  a  considerable  software  engineering 
challenge,  requiring  several  person-years  of  effort.  In  particular,  it  appears  that  efficient 
implementation  of  the  above  proposals  is  still  an  ongoing  effort. 

Lazy  evaluation  itself  does  not  seem  to  be  easy  to  implement  efficiently.  Several 
implementations  of  lazy  evaluation  for  functional,  and  logic-based  languages  have  been 
proposed  e.g.  [Friedman  &  Wise  1976,  Henderson  1980,  Turner  1979,  O’Donnell  1985, 
Clark  &  McCabe  1979,  Hansson  et  al.  1982,  Shapiro  1983,  Barbuti  et  al.  1986]. 

However,  only  a  few  of  these  systems,  e.g.  Turner’s,  or  O’Donnell’s,  seem  to  be  efficient 
enough  for  practical  programming. 

In  view  of  such  difficulties  with  developing,  and  implementing  a  new  computational 
model  of  which  logic  piogramming,  rewriting,  and  lazy  evaluation,  are  instances,  we  ask 
whether  it  is  possible  to  subsume  the  last  two  within  the  first.  In  other  words,  we  ask 
whether  it  is  possible  to  keep  SLD-resolution  fixed,  but  use  it  in  such  a  way  that  it 
performs,  in  a  computationally  feasible  manner,  rewriting,  and  lazy  evaluation?  If  such 
an  attempt  were  to  succeed,  we  would  not  only  obtain  a  declarative  semantics  of 
rewriting,  and  lazy  evaluation  using  purely  logical  ideas,  we  would  also  have  a  very 
efficient  implementation  of  these,  in,  say,  Prolog. 

Important  precedents  in  this  direction  have  already  been  established  with  the 
subsumption,  within  logic  programming,  of  grammars,  and  relational  databases.  Definite 
clause  grammar  rules  [Pereira  &  Warren  1980]  can  be  expressed  as  Horn  clauses  in  such 
a  way  that  their  interpretation  using  Prolog,  directly  simulates  top-down  parsing. 
Relational  databases  can  be  expressed  directly  as  ground  Horn  clauses  [Gallaire  & 

Minker  1978].  Prolog  enables  inference  with  them  in  ways  (e.g.  using  recursion)  not 
possible  with  conventional  data  retrieval  operators. 
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An  important  step  towards  subsuming  rewriting  within  logic  programming,  has  recently 
been  taken  by  van  Emden  &  Yukawa  [1987],  whose  motivations  are  very  similar  to,  but 
independent,  of  ours.  They  show  how  to  derive  logical  consequences  of  the  standard 
equality  axioms  which  result  in  a  small  SLD-search  space.  They  also  show  how  to 
compile  an  equality  theory  into  equality  free  Horn  clauses,  which  also  result  in  a  small 
SLD-search  space.  However,  their  approach  is  restricted  only  to  terminating  equality 
theories.  These  are  insufficient  for  representing  infinite  structures. 

As  pointed  out  in  [Narain  1986],  the  compactness  theorem  of  first  order  logic  [Robinson 
1979]  suggests  that  lazy  evaluation  is  already  present  in  first  order  logic.  It  states  that  if 
an  infinite  set  of  clauses  is  unsatisfiable  then  it  has  a  finite  subset  which  is  also 
unsatisfiable.  Moreover,  a  complete  proof  procedure,  such  as  SLD-resolution  for  Horn 
clauses  would  find  this  set  in  finite  time.  Thus,  as  with  lazy  evaluation,  one  could  get 
termination  in  finite  time  even  with  an  infinite  input. 

This  idea  was  investigated  further,  and  led  to  a  method  in  [Narain  1986],  for  defining 
functions  by  Horn  clauses  in  such  a  way  that  when  SLD-resolution  interprets  these,  it 
behaves  lazily.  However,  the  discussion  is  limited  mainly  to  lists,  although  a 
generalization  to  other  data  structures  is  hinted. 

The  current  system,  LOG(F),  is  an  attempt  to  generalize,  and  develop  a  purely  syntactic 
explanation  of  the  above  method.  It  appears  to  subsume  within  logic  programming,  in  a 
rigorous  yet  computationally  efficient  fashion,  non-terminating,  non-deterministic 
rewriting,  and  lazy  evaluation. 

Minimality  of  DF*  appears  to  be  a  generalization  of  similar  results  by  Vuillemin  [1974], 
and  Berry  &  Levy  [1979].  Both  derive  it  only  for  rewrite  rules  whose  left  hand  sides  are 
of  the  form  f(Xl,..,Xm),  where  each  Xi  is  a  variable.  Thus,  they  must  assume  existence 
of  a  finite  number  of  primitive  functions  such  as  if-then-else,  which  are  not  definable 
using  such  rules  alone.  In  contrast,  F*  admits  rewrite  rules  in  which  the  Xi  can  be 
patterns.  Thus,  in  F*,  as  in  logic  programming,  it  is  not  necessary  to  assume  existence  of 
any  primitive  functions. 

Restrictions  on  rewrite  rules  in  F*,  and  the  reduction  strategy  select,  seem  to  be 
substantially  simpler  than  their  counterparts  in  the  system  of  O’Donnell  [1985].  Select 
also  seems  to  be  substantially  simpler  than  its  counterpart  in  the  system  of  Huet  &  Levy 
[1979].  Furthermore,  since  F*  can  be  compiled  into  efficient  Horn  clauses,  and  Prolog 
can  be  used,  implementation  of  F*  is  straightforward.  However,  implementation  of  the 
other  two  systems  seems  to  be  quite  a  major  undertaking. 

Confluence  of  DF*  is  anticipated  by  Huet  [1980]  who  derives  sufficient  conditions  for 
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confluence  for  rewrite  rule  systems  more  general  than  DF*.  However,  our  proof,  being 
specialized  for  DF*,  is  very  simple. 

LOG(F)  bears  only  a  superficial  similarity  to  the  system  of  Tamaki  [1984].  He  also 
shows  how  to  compile  equality  theories  into  Horn  clauses  with  a  smaller  search  space. 
However,  as  is  pointed  out  in  his  paper,  these  clauses  can  still  be  seriously  inefficient, 
particularly,  when  manipulating  representations  of  infinite  structures.  As  shown  in 
Section  VI.8,  such  inefficiency  is  not  exhibited  by  LOG(F).  Also,  his  reducibility 
predicate  can  terminate  even  without  simplifying  terms  whereas  that  of  LOG(F)  cannot. 

1.4  OUTLINE  OF  PAPER 

Section  II  defines  F*,  and  the  reduction  strategy  select,  and  establishes  its  reduction- 
completeness.  Section  HI  defines  DF*,  and  shows  its  confluence  and  directedness. 
Section  IV  defines  Labeled  DF*,  a  subset  of  DF*,  for  the  purpose  of  formalizing  the 
notion  of  a  copy  of  a  term,  and  then  establishes  its  minimality.  Section  V  describes  an 
algorithm  for  compiling  F*  into  Horn  clauses,  and  proves  its  correctness.  Section  VI 
describes  examples  of  programming  in  LOG(F),  and  compares  performance  of  LOG(F) 
with  that  of  Prolog.  Section  VII  contains  a  summary  and  conclusions.  Proofs  of 
relatively  minor  propositions  have  been  omitted,  or  abbreviated.  These  can  be  obtained 
in  full  in  [Narain  1988]. 
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II.  1  INTRODUCTION 

A  first  order,  non-determinisdc,  non-terminating  rewrite  rule  system  F*,  and  a  lazy 
reduction  strategy  for  it,  select,  are  defined.  The  emphasis  in  F*  is  on  computing 
simplified  forms,  instead  of  normal  forms.  Thus,  termination  problems  faced  by  certain 
previous  approaches  are  avoided. 

The  main  result  proved  is  that  F*  is  reduction-complete,  in  that  select  reduces  ground 
terms  to  their  simplified,  or  normal  forms,  whenever  possible.  Reduction-completeness 
yields  a  weak  form  of  laziness.  A  term  may  denote  an  infinite  object,  and  so  fail  to  have 
a  finite  normal  form.  However,  if  it  has  a  finite  simplified  form,  it  is  obtained  in  finite 
time.  By  repeatedly  simplifying  subterms  of  this  simplified  form,  the  structure  of  the 
infinite  object  can  be  revealed  to  any  arbitrary  depth. 

n.2  DEFINITION  OF  F* 

Variables.  There  is  a  countably  infinite  list  of  variables. 

Function  symbols.  There  is  a  countably  infinite  list  of  0-ary  function  symbols.  In 
particular,  [],  0,  true,  false,  are  0-ary  function  symbols.  There  is  a  countably  infinite  list  of 
1-ary  function  symbols.  In  particular,  s  is  a  1-ary  function  symbol.  There  is  a  countably 
infinite  list  of  2-ary  function  symbols.  In  particular,  I  is  a  2-ary  function  symbol.  And  so 
on,  for  all  other  arities. 

Connectives.  The  connectives  are  =>,  (, ), 

Constructor  Symbols.  There  is  an  infinite  subset  of  the  function  symbols  called 
Constructors.  Each  element  of  Constructors  is  called  a  constructor  symbol.  For  each  n, 
n>=0.  Constructors  contains  an  infinite  number  of  n-ary  function  symbols.  In  particular, 

0,  true,  false,  []  and  I  are  constructor  symbols.  It  is  intended  that  data  be  represented  by 
combinations  of  only  constructor  symbols. 

Terms.  A  term  is  either  a  variable,  or  an  expression  of  the  form  f(tl,..tn)  where  f  is  an  n- 
ary  function  symbol,  n>=0,  and  each  ti  is  a  term.  A  term  is  called  ground  if  it  contains  no 
variables.  It  is  the  intention  in  F*  to  reduce  only  ground  terms,  and  most  of  the 
propositions  below  are  about  these.  Non-ground  terms  such  as  left  hand  sides  of 
reduction  rules  do  arise,  but  in  very  few  propositions.  Hence,  unless  explicitly  stated 
otherwise,  by  a  term  is  meant  a  ground  term. 
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Subterms.  Let  E  be  a  term.  Then  E  is  said  to  be  a  subterm  of  itself.  Also,  if  E=f(tl,..,tn), 
n>0,  then  X  is  said  to  be  a  subterm  of  E,  if  X  is  a  subterm  of  some  ti.  Let  X  be  a  subterm 
of  E.  Then  X  is  said  to  occur  in  E.  Also,  if  X*E,  then  X  is  said  to  be  a  proper  subterm  of 
E,  or  be  properly  contained  in  E.  Two  subterms  A  and  B  of  E  are  said  to  overlap,  if  A  is 
properly  contained  within  B. 

Substitutions.  A  substitution  is  a,  possibly  empty,  set  {<Xl,tl>,..,<Xn,tn>}  where  the 
Xl,..,Xn  are  distinct  variables,  and  each  ti  is  a  term,  possibly  containing  variables.  A 
variable  X  is  defined  in  a  substitution  a  iff  for  some  possibly  non-ground  term  s,  <X,s> 
occurs  in  a.  In  this  paper,  we  will  be  concerned  almost  exclusively  with  substitutions  in 
which  for  each  pair  <X,s>,  s  is  a  ground  term. 

Applying  substitutions  to  terms.  Let  o={<Xl,tl>,..,<Xn,tn>}  be  a  substitution  and  E  be 
a  term,  possibly  containing  variables.  The  result  of  applying  o  to  E,  Ecr,  is  the  result  of 
replacing,  for  each  i,  every  occurrence  of  Xi  in  E  by  ti. 

Matching.  A  ground  term  E  is  said  to  match  a  possibly  non-ground  term  F,  with 
substitution  a,  if  E=Fa. 

Unification.  Two  terms,  E  and  F,  possibly  containing  variables,  are  said  to  unify  with 
substitution  a  if  Ea=Fc.  Note  that  matching  is  a  special  case  of  unification. 

Reduction  Rules.  A  reduction  rule  is  of  the  form: 

LHS=>RHS 

where  LHS  and  RHS  are  terms,  possibly  containing  variables.  LHS  is  called  the  head  of 
the  rule.  The  following  restrictions  are  placed  on  LHS  and  RHS: 

(a)  LHS  is  not  a  variable. 

(b)  LHS  is  not  of  the  form  c(tl,..,tn)  where  c  is  a  constructor  symbol. 

(c)  If  LHS=f(tl,t2,..,tn),  then  each  ti  is  a  variable,  or  a  term  of  the  form 
c(Xl,..,Xm)  where  c  is  an  m-ary  constructor  symbol,  and  each  Xi  a  variable. 

(d)  There  is  at  most  one  occurrence  of  any  variable  in  LHS. 

(e)  All  variables  of  RHS  appear  in  LHS. 

These  restrictions  are  very  reasonable,  and  as  examples  throughout  the  paper  show,  very 
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expressive  programs  can  be  written  adhering  to  these.  Note  that  F*  is  more  expressive 
than  first  order  Lisp,  as  the  latter  does  not  admit  patterns  in  left  hand  sides  of  function 
definitions. 

Restriction  (a)  is  to  enable  functional  programs  to  be  written  in  F*. 

Restriction  (b)  ensures  that  a  term  of  the  form  c(tl,..,tn),  c  a  constructor  symbol,  cannot 
be  reduced  as  a  whole.  This  yields  a  simple  halting  condition  for  the  basic  simplification 
process.  If  further  simplification  is  required,  the  process  may  be  called  recursively. 

Restriction  (c)  limits  heads  of  rules  to  be  of  depth  at  most  two,  and  so  greatly  simplifies 
analysis.  However,  no  generality  is  lost,  since  rules  with  heads  of  arbitrary  depth  can 
easily  be  expressed  in  terms  of  rules  with  heads  of  depth  at  most  two.  For  example,  the 
rule: 


fib(s(s(X)))=>plus(fib(X),fib(s(X))) 

can  be  expressed  as: 

fib(s(A))=>g(A) 

g(s(X))=>plus(fib(X),fib(s(X)). 

Restriction  (d)  is  the  linearity  assumption.  It  ensures  that  to  match  a  ground  term 
f(tl,..,tn)  with  the  left  hand  side  of  a  rule  f(Ll,..,Ln),  it  is  sufficient  to  match,  for  each  i,  ti 
with  Li. 

Restriction  (e)  ensures  that  a  ground  term  is  never  reduced  to  a  non-ground  term.  Again, 
this  is  necessary  if  F*  is  to  be  used  for  functional  programming. 

F*  programs.  An  F*  program  is  a  finite  set  of  reduction  rules.  Where  t  is  a  binary 
constructor  symbol,  some  examples  of  F*  programs  are: 

quicksort([])=>[]. 

quicksort([AIB])=>quicksortl(A,partition(A,B,[],[])). 

quicksortl(A,t(L,R))=>append(quicksort(L),[Alquicksort(R)]). 

partition(U,[]JL,R)=>t(L,R). 
parti  tion(U,[AIB],L,R)=> 

if(lesseq(A,U),partition(U,B,  [AIL],  R),  parti  tion(U,B,L,[AIR])). 


append([],X)=>X 
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append([UIV],W)=>[Ulappend(V,W)] 

if(true,X,Y)=>X. 

if(false,X,Y)=>Y. 

lesseq(0,X)=>true. 

lesseq(s(X),s(Y))=>lesseq(X,Y). 

lesseq(s(X),0)=>false. 

zero(X)=>0. 

prim_rec_f(0,Y  1  ,Y2,  Y3)=>g(Y  1 ,  Y2,  Y3). 

prim_rec_f(s(X),  Y 1  ,Y2,Y3)=>h(prim_rec_f(X,Y  1  ,Y2,Y3),X,Y  1 ,  Y  2,Y3). 
minim_p(X,K)=>if(equal(p(X),K),X,minim_p(s(X),K)). 

equal(0,0)=>truc. 

equaI(0,s(X))=>falsc. 

equal(s(X),0)=>false. 

equal(s(X),s(Y))=>equal(X,Y). 

merge([AIB],[CID])=>ifGesseq(A,C),[Almerge(B,[CID]),[Clmerge([AIB]4))]). 

int(N)=>[Nlint(s(N))] . 

greater(X>Y)=>not(lesseq(X,Y)). 

not(true)=>false. 

not(falsc)=>true. 

We  now  consider  the  reduction  of  terms.  Again,  unless  explicitly  stated,  by  a  term  we 
mean  a  ground  term. 

E=>pEI.  Let  P  be  an  F*  program  and  E  and  El  be  terms.  We  say  E=>pEl  if  there  is  a 
rule  LHS=>RHS  in  P,  and  a  substitution  o  such  that  E=LHSa,  and  El=RHSa.  We  also 
say  that  E  reduces  to  El  by  the  rule  LHS=>RHS,  or  that  the  rule  applies  to  the  whole  of 
E.  The  subscript  on  =>  is  dropped,  if  clear  from  context. 

F=E[G/H].  Where  E,F,G,H,  are  terms,  let  F  be  the  result  of  replacing  an  occurrence  of  G 
in  E  by  H.  Then  we  say  F=E[G/H], 

E->pEl,E-*>pEl.  Let  P  be  an  F*  program  and  E  be  a  term.  Let  G  be  a  subterm  of  E 
such  that  G=>pH.  Let  El  be  the  result  of  substituting  H  for  G  in  E.  Then  we  say  that  E- 
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>pEl .  Note  that  if  E=>pEl  then  E  matches  the  left  hand  side  of  some  rule  in  P.  If  E- 
>pE  1  then  some  subterm  of  E,  including  possibly  E,  matches  the  left  hand  side  of  some 
rule  in  P.  We  define  -*>p  to  be  the  reflexive  transitive  closure  of  ->p.  Again,  the  subscript 
on  ->  or  -*>  is  dropped,  if  clear  from  context. 

Reductions.  Let  P  be  an  F*  program.  A  reduction  in  P  is  a,  possibly  infinite,  sequence 
E1,E2,...  such  that  for  each  i,  when  Ei  and  Ei+1  both  exist,  Ei->pEi+l. 

Lengths  of  reductions.  The  length  of  a  finite  reduction  E0,El,..,En  is  n. 

Simplified  forms.  A  term  is  said  to  be  in  simplified  form  or  simplified  if  it  is  of  the  form 
c(tl,..,tn)  where  c  is  an  n-ary  constructor  symbol,  n>=0,  and  each  ti  is  a  term.  F  is  called  a 
simplified  form  of  E,  if  E-*>F  and  F  is  in  simplified  form. 

Normal  forms.  A  term  is  said  to  be  in  normal  form  if  each  function  symbol  in  it  is  a 
constructor  symbol.  F  is  called  a  normal  form  of  E  if  E-*>F  and  F  is  in  normal  form. 

Successful  reductions.  Let  P  be  an  F*  program.  A  successful  reduction  in  P  is  a  finite 
reduction  E0,..JEn,  n>=0,  in  P,  such  that  En  is  simplified. 

Rp(G,H,A,B)-  Let  P  be  an  F*  program.  Where  GJH,A,B  are  terms,  Rp(G,H,A,B)  if  (a) 
G=>H,  and  (b)  B  is  identical  with  A  except  that  zero  or  more  occurrences  of  G  in  A  are 
simultaneously  replaced  by  H.  Note  that  A  and  G  can  be  identical.  Again,  if  P  is  clear 
from  context  we  omit  the  subscript  on  R. 

Reduction  strategy.  Let  P  be  an  F*  program.  A  reduction  strategy  for  P  takes  as  input  a 
term  E  and  selects  a  subterm  G  of  E  such  that  there  exists  a  term  H  such  that  G=>pH. 

A  special  reduction  strategy.  Let  P  be  an  F*  program.  We  now  define  a  reduction 
strategy,  selectp  for  P.  Informally,  given  a  term  E  it  will  select  that  subterm  of  E  whose 
reduction  is  necessary  in  order  that  some  =>  rule  in  P  apply  to  the  whole  of  E.  Where 
f(Tl,..,Tn)  is  a  term,  the  relation  selectp  is  defined  by  the  following  pseudo-Horn  clauses: 

selectp(f(Tl,..,Tn),f(Tl,..,Tn))  if  f(Tl,..,Tn)=>pX. 
selectp(f(Tl,..,Ti,..,Tn),X)  if 

there  is  a  rule  f(Ll,..,Li,..,Ln)=>RHS  in  P,  and 
there  is  no  substitution  a  such  that  Ti=Licr,  and 
selectp(Ti.X). 

The  second  rule  is  a  schema,  so  that  an  instance  of  it  is  assumed  written  for  each  each  i, 
l=<i=<n.  Again,  the  subscript  on  select  is  dropped,  if  clear  from  context.  Note  the 
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following: 

(1)  When  selectp  takes  as  input  E  and  returns  G,  it  also,  implicitly,  computes  a 
position,  or  occurrence  of  G  in  E.  This  occurrence  can  be  obtained  from  the  proof 
of  selectp(E,G). 

(2)  If  selectp(E,G),  there  is  a  term  H  such  that  G=>pH. 

(3)  Select  is  non-deterministic,  in  that  given  term  E,  it  is  possible  for  it  to  select 
more  than  one  subterm  Al,..,Ak,  k>0,  within  E.  Also,  it  is  possible  that  for  some 
i,j,  i*j,  Ai  is  a  proper  subterm  of  Aj. 

(4)  Since,  by  restriction  (b)  there  is  no  rule  in  P  of  the  form  c(tl,..,tn)=>RHS, 
where  c  is  a  constructor  symbol,  if  E  is  simplified,  selectp  is  undefined  for  E. 

For  example,  where  P  is  the  set  of  reduction  rules  which  appear  above,  and  1,2,...  are 
abbreviations,  respectively,  for  s(0),s(s(0)),..,  we  have  the  following: 

select(merge(int(l),int(2)),int(l)). 
select(merge(int(  1  ),int(2)),int(2)). 
select(merge([l,3],int(2)),int(2)). 
select(merge([l,2],[3,4]),merge([l,2],[3,4])). 

If  E=[llmerge(int(l),int(2))]  then  select  is  undefined  for  E. 
select(lesseq(0,zero(  1  )),lesseq(0,zero(  1 ))). 
select(lesseq(0,zero(  1  )),zero(  1 )). 

Let  E,GJH  be  terms.  In  the  following,  when  we  say  that  select(E,G),  and  G  is  to  be 
replaced  by  H  in  E,  we  mean  that  the  occurrence  of  G  derived  from  the  proof  of 
select(E,G),  is  to  be  replaced  by  H. 

N-step.  Let  P  be  an  F*  program  and  E,G,H  be  terms.  Let  selectp(E,G),  and  G=>pH.  Let 
El  be  the  result  of  replacing  G  by  H  in  E.  Then  we  say  that  E  reduces  to  El  in  an  N-step 
in  P.  The  qualification  "in  P"  is  omitted  when  P  is  clear  from  context.  The  prefix  N  in  N- 
step  is  intended  to  connote  normal  order. 

N-reduction.  Let  P  be  an  F*  program.  An  N-reduction  in  P  is  a  reduction  E1,E2,....  in  P 
such  that  for  each  i,  when  Ei  and  Ei+1  both  exist,  Ei  reduces  to  Ei+1  in  an  N-step  in  P.  In 
particular,  the  sequence  E  where  E  is  a  term,  is  an  N-reduction  in  P.  The  qualification  "in 
P"  is  omitted  when  P  is  clear  from  the  context. 

select-r.  This  reduction  strategy  repeatedly  uses  select  to  reduce  terms.  The  suffix  r 
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stands  for  recursive,  or  repeated.  Where  P  is  an  F*  program: 

select-rp(E,F)  if  selectp(E,F). 
select  rp(c(Tl,..,Ti,..,Tm)Jr)  if 

c  is  a  constmctor  symbol,  and 
select-rp(Ti,F). 

Again,  the  second  rule  is  a  schema,  so  that  an  instance  of  it  is  assumed  written  for  each  i, 
l=<i=<n.  Thus,  select-r  is  like  select  except  that  if  a  term  is  in  simplified  form,  it 
recursively  uses  select  on  one  of  the  arguments  of  the  outermost  constructor  symbol.  So, 
its  repeated  use  can  yield  normal-forms  of  terms. 

For  example,  with  the  usual  rules  for  append,  the  query  select([llappend([],[])],X)  fails, 
whereas  the  query  select-r([llappend([],[])],X)  succeeds  with  X=append(n,[])-  The 
subscript  on  select-r  is  dropped,  if  clear  from  context. 

NR-step.  Let  P  be  an  F*  program  and  E,G,H  be  terms.  Suppose  select-rp(E,G)  and 
G=>pH.  Let  El  be  the  result  of  replacing  G  by  H  in  E.  Then  we  say  that  E  reduces  to  El 
in  an  NR-step  in  P.  The  qualification  "in  P"  is  omitted  when  clear  from  context. 

NR-reduction.  Let  P  be  an  F*  program.  An  NR-reduction  in  P  is  a  reduction  E1,E2,.... 
in  P  such  that  for  each  i,  when  Ei  and  Ei+1  both  exist,  Ei  reduces  to  Ei+1  in  an  NR-step 
in  P.  In  particular,  the  sequence  E  where  E  is  a  term,  is  an  NR-reduction  in  P. 

NR-reductions  are  needed  to  compute  normal-forms  of  terms.  For  example,  the  term 
append([l],[2])  has  the  only  N-reduction  append([l],[2]),  [llappend([],[2])].  However,  it 
has  the  NR-reduction  append([l],[2]),  [llappend([],[2])],  [1,2].  The  qualification  "in  P"  is 
omitted  when  clear  from  context 

IL3  REDUCTION-COMPLETENESS  OF  F* 

Lemma  1.  Let  P  be  an  F*  program.  If  A->B  and  B  is  simplified  but  A  is  not,  then  A=>B. 
Proof.  Clear,  from  restriction  (b).  QED. 

Lemma  2.  Let  P  be  an  F*  program.  Let  Xl,..,Xn  be  variables,  G,H,tl,..,tn,tl*,..,tn*  be 
terms  such  that  for  each  i,  R(G,H,ti,ti*).  Let  c={<Xl,tl>,..,<Xn,tn>)  and 
T=(<Xl,tl*>,..,<Xn,tn*>}  be  substitutions.  Let  M  be  a  term,  possibly  containing 
variables,  but  only  from  {Xl,..,Xn}.  Then  R(G,H,Ma,Mx). 


Proof.  By  induction  on  length  of  M.  QED. 
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Lemma  3.  Let  P  be  an  F*  program.  If: 

(1)  G,  H,  El=f(tl,..,tn)  and  Fl=f(tl*,..,tn*)  are  terms,  and 

(2)  R(G,H,ti,ti*)  for  every  i  in  l,..,n,  and 

(3)  B=f(Ll,..,Ln)  is  the  head  of  some  rule  in  P,  and 

(4)  El=Ba  for  some  substitution  a,  which  defines  only  the  variables  in  B. 

Then  there  exists  a  substitution  x  such  that: 


(1) Fl=Bx,  and 

(2)  a  and  X  define  exactly  the  same  variables,  and 

(3)  If  pair  <X,s>  occurs  in  c,  and  <X,s*>  occurs  in  x,  then  R(G,H,s,s*). 

Proof.  Clear,  by  restrictions  (a)-(e).  QED. 

Lemma  4.  Let  P  be  an  F*  program.  If: 

(1)  f(tl,..,ti,..,tn)  is  a  term,  and 

(2)  f(Ll,..,Li-l,c(Xl,..,Xm),Li+l,..,Ln)=>RHS  is  a  rule  in  P,  and 

(3)  ti=dl,d2,d3,..,dr,  r>0,  is  an  N-reduction. 


Then,  f(tl,..,ti-l,dl,ti+l,..,tn),f(tl,..,ti-l,d2,ti+l,..,tn), ..,  f(tl,..,ti-l,dr,ti+l,..,tn)  is  also 


N-reduction. 


Proof.  By  definition  of  N-reduction.  QED. 

Theorem  1.  Let  P  be  an  F*  program.  Let  E1,F1,F2,G,H  be  terms  such  that 

(1)  R(G,H,E1,F1),  and 

(2)  FI  reduces  to  F2  in  an  N-step 

Then  there  is  an  N-reduction  E1,..,E2  in  P  such  that  R(G,H,E2,F2). 
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Proof.  It  is  helpful  to  draw  the  following  diagram: 
N-step 

FI - >F2 


|  R(G,H,E1,F1) 

I 

El - *>E2 

N-reduction 

We  have  to  show  that  R(G,H,E2,F2).  We  proceed  by  induction  on  length  of  El. 

Suppose  El  is  a  0-ary  function  symbol.  If  E1=F1  then  E1,F2  is  an  N-reduction  and 
R(G,H,F2,F2).  If  E1*F1  then  since  R(G,H,E1  ^Fl),  E1=G  and  E1=>F1.  Thus,  there  is  an 
N-reduction  E1,F1,F2  and  R(G,H,F2,F2).  In  both  cases,  take  E2=F2. 

Otherwise,  El=f(tl,..,tn),  n>0.  Assume  the  theorem  for  every  term  whose  length  is  less 
than  that  of  f(tl,..,tn).  If  E1=F1  then  E1,F2  is  an  N-reduction  and  R(GJH[,F2JF2). 
Otherwise  El ^Fl.  If  E  1=0  then  since  R(G,H,E1,F1),  E1=>F1.  Thus,  there  is  an  N- 
reduction  E1JF1.F2,  and  R(GJH,F2,F2).  Again,  in  both  cases,  take  E2=F2. 

We  now  arrive  at  the  interesting  cases,  with  E1*F1,  but  G*E1.  Hence  Fl=f(tl*,..,tn*) 
where  for  every  i,  R(G,H,ti,ti*).  We  now  consider  the  following  cases: 

Case  1.  F1=>F2.  Then  there  is  a  rule  f(Ll,..,Ln)=>RHS  in  P,  such  that  FI  matches 
f(Ll,..,Ln)  with  substitition  x,  and  F2=RHSx. 

Case  1-1.  El  matches  f(Ll,..JLn)  with  substitution  a.  By  Lemma  3,  there  exists 
substitution  p  such  that  Fl=f(Ll,..,Ln)P.  Since  Fl=f(Ll,..,Ln)x,  x=p. 

El=>RHSa,  so  let  E2=RHSct.  The  N-reduction  is  El  32.  Of  course  F2=RHSx. 
By  Lemma  3,  c  and  x  define  exactly  the  same  variables,  and  if  <X,s>  occurs  in  a 
and  <X,s*>  appears  in  x  then  R(G,H,s,s*).  Hence,  by  Lemma  2,  R(G,H,E2,F2). 

Case  1-2.  El  does  not  match  f(Ll,..,Ln).  Then,  since  El  is  ground  and  each 
variable  occurs  at  most  once  in  f(Ll,..,Ln),  there  is  some  Li  in  Ll,..,Ln,  and  some 
ti  in  tl,..,tn,  such  that  ti  does  not  match  Li.  Hence  Li  is  not  a  variable,  so 
Li=c(Xl,..,Xm),  c  a  constuctor  symbol  and  each  Xi  a  variable. 

Moreover,  since  R(G,H,ti,ti*),  and  ti  does  not  match  Li,  by  restriction  (c),  ti  is  not 
simplified.  Since  FI  matches  f(Ll,..,Ln),  ti*  matches  Li,  and  so  ti*  is  simplified. 
Since  R(G,H,ti,ti*),  ti=>ti*.  Thus  select(El,ti).  Hence  f(tl,..,ti,..,tn)  reduces  to 
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f(tl,..,ti*,..,tn)  in  an  N-step. 


Hence  there  exists  an  N-reduction  E1=P1,P2,P3,...  such  that  for  each  i, 
Pi=f(sl,..,sn),  and  for  each  sk  in  sl,..,sn,  sk=tk  or  sk=tk*.  Moreover,  Pi+1  is 
derived  from  Pi  by  selecting  some  sk  in  sl,..,sn  such  that  sk  does  not  match  Lk  in 
Ll,..,Ln,  and  replacing  sk,  in  Pi,  by  tk*.  We  also  have  for  each  i,  R(G,H,Pi,Fl). 
Since  n  is  finite,  this  reduction  cannot  be  infinite  and  must  end  in  Pm  such  that  Pm 
matches  f(Ll,..,Ln)  with  substitution  a.  Then  Pm=>RHSa.  Hence  we  have  the 
N-reduction  El,P2,P3,..,Pm,RHSa.  Take  E2=RHSa.  By  Lemma  3,  FI  and 
f(Ll,..,Ln)  match  with  some  substitution,  and  clearly  this  is  x.  Already, 

F2=RHSt.  By  Lemma  2,  R(G,H,E2,F2). 

Case  2.  Not  F1=>F2.  We  are  given  that  FI  reduces  to  F2  by  an  N-step.  We  now  have  to 
show  that  there  is  an  N-reduction  E1,..,E2  such  that  R(G,H,E2,F2). 

Suppose  select(Fl,u).  Then  u  occurs  in  some  ti*.  That  is,  there  is  some  ti*  in  tl*,..,tn*, 
such  that  select(ti*,u).  Let  u=>v  and  let  ti**  be  the  result  of  replacing  u  in  ti*  by  v. 

Hence  ti*  reduces  to  ti**  in  an  N-step,  and  also  F2=f(tl*,..,ti**,..,tn*).  By  definition  of 
select,  there  is  a  rule  f(Ll,..,Li,..,Ln)=>RHS  in  P  such  that  ti*  does  not  match  Li.  Hence 
Li=c(Xl,..,Xm),  m>=0,  where  c  is  a  constructor  symbol  and  each  Xi  is  a  variable. 

Clearly,  ti*  is  not  simplified.  So,  by  restriction  (b)  ti  is  also  not  simplified,  ti*  reduces  to 
ti**  in  an  N-step.  We  already  have  R(G,H,ti,ti*).  Since  the  length  of  ti  is  less  than  that 
of  f(tl,..,ti,..,tn),  by  induction  hypothesis  there  is  an  N-reduction  ti=dl,d2,..,dr,  r>=l,  such 
that  R(GJH,dr,ti**).  By  Lemma  4,  the  sequence  f(tl,..,ti-l,ti,ti+l,..,tn),  f(tl,..,ti- 
l,d2,ti+l,..,tn),..,  f(tl,..,ti-l,dr,ti+l..,tn)  is  an  N-reduction.  Take  E2=f(tl,..,ti- 
l,dr,ti+l..,tn).  We  already  have  F2=f(tl*,..,ti**,..,tn*)  and  for  each  k,  R(G,H,tk,tk*). 
Hence  R(G,H,E2,F2).  QED. 

Lemma  5.  Let  P  be  an  F*  program.  Let  R(GJH,E0,F0)  and  F0,Fl,..,Fn  be  an  N-reduction. 
Then  there  is  an  N-reduction  E0,..,El,..,En  such  that  R(G,H,En,Fn). 

Proof.  By  induction  on  length  n  of  F0,Fl,..,Fn.  If  n=0  then  clear.  Otherwise  assume 
lemma  for  the  N-reduction  Fl,..,Fn.  Since  F0  reduces  to  FI  in  an  N-step  and 
R(G,H,E0,F0),  by  Theorem  1,  there  exists  an  N-reduction  E0,..,E1  such  that 
R(G,H,E1,F1).  By  induction  hypothesis,  there  exists  an  N-reduction  El,..,En,  such  that 
R(G,H,En,Fn).  Hence  there  exists  the  N-reduction  E0,..,El,..,En  such  that  R(G,H,En,Fn). 
QED. 

Theorem  2.  Reduction-completeness  of  F*  for  simplified  forms.  Let  P  be  an  F* 
program  and  DO  a  term.  Let  D0,Dl,..,Dn,  n>=0,  be  a  successful  reduction  in  P.  Then 
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there  is  a  successful  N-reduction  DO, El,.. 3m  in  P,  such  thatEm-*>Dn. 

Proof.  By  induction  on  length  n  of  D0,Dl,..,Dn.  If  n=0,  DO  is  already  simplified,  so  DO  is 
a  successful  N-reduction,  and  D0-*>D0. 

Let  n>0  and  assume  Theorem  for  Dl,..,Dn.  Then  there  is  a  successful  N-reduction 
Dl,F2,..,Fp  such  that  Fp-*>Dn.  The  situation  can  be  laid  out  as  follows: 

Dn 


D2 

I 

Dl->F2-*>Fp 

I 

|  R (G, H, Em, Fp) ,  Fp-*>Dn 

I 

D0->El-*>Em 

Since  D0->D1,  there  are  terms  G,H,  such  that  G=>H  and  D1=D0[G/H].  Hence 
R(G,H,D0,D1).  Since  Dl,F2,..JFp  is  a  successful  N-reduction,  by  Lemma  5,  there  is  an 
N-reduction  D0,El,..,Eq  such  that  R(G,H,Eq,Fp).  If  Eq  is  simplified,  take  Em=Eq.  Now 
D0,El,..,Em  is  a  successful  N-reduction.  Since  R(G,H,Em,Fp),  and  Fp-*>Dn,  Em-*>Dn, 
as  required. 

If  Eq  is  not  simplified,  then  since  R(G,H,Eq,Fp),  and  Fp  is  simplified,  Eq=>Fp.  Now  take 
Em=Fp,  so  D0,El,..,Eq,Em  is  a  successful  N-reduction,  and  Em-*>Dn,  as  required. 

QED. 

Theorem  3.  Let  P  be  an  F*  program.  Let  E1,F1,F2,G,H  be  terms  such  that 

(1)  R(G,H,E1,F1),  and 

(2)  FI  reduces  to  F2  in  an  NR-step 

Then  there  is  an  NR-reduction  E1,..,E2  in  P  such  that  R(G,H,E2,F2). 

Proof.  By  induction  on  length  of  El.  Let  El  be  a  0-ary  function  symbol.  If  E1=F1  then 
clear.  If  E1*F1,  then  E1=G,  and  so,  clear.  Otherwise,  let  El=f(tl,..,tn),  n>0.  Assume 
theorem  for  tl,..,tn. 

Case  1.  El  is  unsimplified.  If  FI  is  simplified,  then  since  R(G,H,E1,F1),  E1=G,  so  the 
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theorem  is  clear.  If  FI  is  unsimplified,  then  by  definition  of  NR-reduction,  FI  reduces  to 
F2  in  an  N-step.  By  Theorem  1,  there  exists  an  N-reduction  El  ,..,E2  such  that 
R(G,H,E2,F2).  But  this  is  also  an  NR-reduction. 

Case  2.  El  is  simplified.  Then,  since  R(G,H,E1,F1),  FI  is  also  simplified.  Let 
Fl=f(sl,..,sn)  where  f  is  a  constructor  symbol.  Hence  for  each  i,  l=<i=<n,  R(G,H,ti,si). 
Since  FI  reduces  to  F2  in  an  NR-step,  there  is  some  si  in  sl,..,sn,  such  that  si  reduces  to 
some  si*  in  an  NR-step  and  F2=f(sl,..,si*,..,sn).  By  induction  hypothesis,  there  exists  an 
NR-reduction  ti=til,ti2,..,tik  such  that  R(G,H,tik,si*).  It  can  easily  be  shown  that  the 
reduction  f(tl,..,til,..,tn),  f(tl,..,ti2,..,tn), ..,  f(tl,..,tik,..,tn)  is  also  an  NR-reduction.  Clearly 
R(GJH,f(tl,..,tik,..,tn),F2).  QED. 

Lemma  6.  Let  P  be  an  F*  program.  Let  R(G,H,E0,F0)  and  F0,..JFn,  n>=0,  be  an  NR- 
reduction.  Then  there  is  an  NR-reduction  E0,..,Ek  such  that  R(G,H,Ek,Fn). 

Proof.  Similar  to  that  of  Lemma  5.  QED. 

Theorem  4.  Reduction-completeness  of  F*  for  normal  forms.  Let  P  be  an  F*  program 
and  DO  a  term.  Let  D0,Dl,..,Dn,  n>=0,  be  a  reduction  in  P,  where  Dn  is  in  normal  form. 
Then  there  is  an  NR-reduction  D0,El,..,Em=Dn,  m>=0,  in  P. 

Proof.  By  induction  on  length  n  of  D0,Dl,..,Dn.  If  n=0,  DO  is  already  in  normal  form,  so 
DO  is  the  required  NR-reduction. 

Let  n>0  and  assume  theorem  for  Dl,..,Dn.  Then  there  is  an  NR-reduction 
D1  ,F2,..,Fp=Dn.  The  situation  can  be  laid  out  as  follows: 

Dn  (in  normal  form) 


D2 

I 

D1->F2- *>Fp=Dn 
! 

D0->Sl-*>Em 

Since  D0->D1  there  are  terms  G>H,  such  that  G=>H  and  D1=D0[G/H].  Hence 
R(G,H,D0,D1).  Since  Dl,F2,..,Fp  is  an  NR-reduction,  by  Lemma  6,  there  is  an  NR- 
reduction  D0,El,..,Eq  such  that  R(G,H,Eq,Fp).  It  is  easily  shown,  by  induction  on  length 
of  terms,  that  there  is  an  NR-reduction  Eq,..,Fp.  In  each  step  in  it,  an  occurrence  of  G  is 
replaced  by  H.  The  required  NR-reduction  is  then  D0,El,..,Eq,.. ,Fp=Dn=Em.  QED. 
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m.l  INTRODUCTION 

A  class  of  F*  programs  called  Deterministic  F*  (DF*)  is  now  defined  and  shown  to 
possess  several  useful  computational  properties.  In  particular,  every  DF*  program 
satisfies  confluence  and  directedness.  Confluence  is  shown  to  hold  for  any  F*  program 
which  satisfies  just  restriction  (f)  below. 

Confluence,  means  that  if  for  terms  M,N,P,  M-*>N,  and  M-*>P,  then  there  exists  term  Q 
such  that  N-*>Q,  and  P-*>Q.  It  has  the  immediate  consequence  that  every  term  has  at 
most  one  normal  form.  Hence  DF*  can  be  used  as  a  functional  programming  system. 
Also,  if  a  DF*  program  is  interpreted  as  an  equality  theory,  equality  of  two  terms  can  be 
determined  by  checking  whether  their  normal  forms  are  syntactically  identical. 

Directedness,  for  simplified  forms,  means  that  if  a  term  has  a  simplified  form  then  any 
N-reduction  starting  at  that  term,  if  extended  far  enough,  computes  it.  Moreover,  all 
successful  N-reductions  are  of  equal  length.  Directedness,  for  normal  forms,  means  that 
if  a  term  has  a  normal  form,  then  any  NR-reduction  starting  at  that  term,  if  extended  far 
enough,  computes  it.  Moreover,  all  NR-reductions  ending  in  normal  forms  are  of  equal 
length.  Due  to  directedness,  no  searching  among  alternative  N-  or  NR-reductions  is 
necessary. 

m.2  DEFINITION  OF  DF* 

A  DF*  program  is  an  F*  program  P  satisfying  two  restrictions: 

(f)  Let  LHS 1  and  LHS2  be  variants  of  heads  of  two  rules  in  P,  such  that  LHS 1 
and  LHS2  have  no  variables  in  common.  Then  LHS1  and  LHS2  do  not  unify. 

(g)  Let  f(Ll,- .,Li,..,Lm)=>RHS  be  a  rule  in  P,  where  Li  is  not  a  variable.  Then  in 
every  other  rule  f(Kl,..,Ki,..,Km)=>RHSl  in  P,  Ki  is  not  a  variable. 

Again,  as  can  be  seen  from  examples  in  this  paper,  and  especially  in  Section  VI,  DF*  is 
also  quite  expressive.  Note  that  restrictions  (a)-(e)  are  upon  rules  while  (f)  and  (g)  are 
upon  the  entire  program.  In  the  following,  the  first  three  rules  do  not  constitute  a  DF* 
program  because  they  violate  (f)  and  the  next  four  do  not  because  they  violate  (g): 

insert(A,[])=>[A]. 

insert(A,[UIV])=>[A,UIV]. 

insert(A,[UIV])=>[Ulinsert(A,V)]. 
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f(X, [])=>[]. 
f([],[UIV])=>[]. 
a=>a. 
b=>[]. 

Restriction  (f),  supported  by  (a)-(e),  ensures  confluence.  Restriction  (g),  supported  by 
(a)-(f),  ensures  directedness.  In  particular,  given  a  term  f(tl,..,ti,..,tn),  it  is  possible  to 
determine,  at  compile  time,  whether  ti  needs  to  be  simplified.  It  needs  to  be,  only  if  the 
ith  argument  in  the  head  of  any  F*  rule  defining  f  is  a  non-variable.  Moreover,  as  shown 
below,  the  arguments  of  f  which  do  need  to  be  simplified  can  be  simplified  in  any  order. 

The  importance  of  select  may  be  emphasized  from  the  observation  that  even  with 
restrictions  (a)-(g),  every  outermost  reduction  strategy  is  not  reduction  complete.  For 
example,  given  the  DF*  program: 

g([]>X)=>[] 
g([UIV],W)=>.  .  . 


and  the  term  E=g(b,a),  a  rightmost-outermost  reduction  strategy  would  compute  the 
infinite  reduction  g(b,a),g(b,a),....  However,  there  exists  a  successful  leftmost-outermost 
reduction  g(b,a),g([],a),[].  Select,  of  course,  would  compute  this  second  reduction. 

Thus,  select  is  more  than  an  outermost  reduction  strategy.  For  DF*,  it  may  perhaps  be 
called  outermost-call-by-need.  Note  that  it  is  still  non-deterministic.  However  due  to 
directedness,  the  non-determinism  is  benign. 

Sp(A,B)-  Let  P  be  an  F*  program,  and  A,B  be  terms.  Let  Gl,..,Gm,  m>=0,  be  mutually 
non-overlapping  subterms  in  A,  and  Hl,..,Hm  be  terms  such  that  for  each  i=<m,  Gi=>Hi, 
and  B  is  the  result  of  simultaneously  replacing  Gl,..,Gm,  in  A,  by  respectively,  Hl,..,Hm. 
Then  we  say  Sp(A,B).  Note  that  Gl,..,Gm  need  not  include  all,  or  even  one,  of  the 
mutually  non-overlapping  subterms  of  A  which  reduce  as  a  whole.  The  subscript  on  S  is 
dropped  if  clear  from  context. 

R<©i.  Let  R  be  an  N-reduction  E0,El,..,Em,  m>=0,  where  for  no  i,  Ei=>Ei+l.  Then 
there  is  some  function  symbol  f,  such  that  each  Ei  is  of  the  form  f(pl,..,pk).  Let 
EO=f(tl,..,tk),  and  for  any  p,  let  Rp  be  E0,El,..,Ep. 

For  any  l=<i=<k,  R0@i  is  defined  to  be  the  singleton  sequence  ti.  For  any  j*m,  let 
Ej=f(al,..,an-l,an,an+l,..,ak),  and  Ej+l=f(al,..,an-l,bn,an+l,..,ak)  such  that  an  reduces  to 
bn  in  an  N-step.  If  n=i,  then  Rj+l@i=Rj@i:bn,  otherwise,  Rj+l@i=Rj@i.  Here  :  is 
concatenation  of  a  term  at  the  end  of  a  sequence  of  terms. 
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For  example,  with  the  rules  a=>a,  b=>bl,  bl=>b2,  and  the  N-reduction  R=f(a,b),f(a,bl), 
f(a,bl),f(a,b2),  R@l=a,a,  and  R@2=b,bl,b2.  Thus,  roughly,  R@i  is  the  sequence, 
without  duplicates,  of  ith  arguments  of  the  outermost  function  symbol  of  the  members  of 
the  N-reduction  R. 

R@u.  R@u  is  a  generalization  of  R@i  to  positions  in  terms.  Let  R  be  an  NR-reduction 
E0,El,..,Em,  m>=0,  where  EO  is  simplified.  Let  A0,Al,..,Ap  be  unsimplified  terms  in  EO 
such  that  no  Ai  is  properly  contained  in  any  other  unsimplified  term.  Let  the  positions  of 
A0,Al,..,Ap,  in  EO  be  ul,..,up  respectively.  Then,  for  each  j^m,  Ej+1  can  be  thought  of 
as  being  derived  from  Ej  by  replacing  a  term  A  at  one  of  the  ui,  by  another  term  B. 
Moreover,  A  reduces  to  B  in  an  NR-step. 

For  any  ui  in  ul,..,up,  R0@ui  is  defined  to  be  the  singleton  sequence  Ai.  For  any  j,  j*m, 
let  Ej+1  be  derived  from  Ej  by  replacing  a  term  P  at  position  u  in  Ej  by  Q,  where  u  is  in 
ul,..,up.  If  u=ui,  Rj+l@ui=Rj@ui:Q,  otherwise,  Rj+l@ui=Rj@ui. 

m.3  CONFLUENCE  AND  DIRECTEDNESS  OF  DF* 

Confluence  and  directedness  are  shown  by  deriving  the  following  results,  for  any  DF* 
program  P: 

(a)  Let  F1,E1,F2  be  terms  such  that  S(F1,E1),  and  F1->F2.  Then  there  exists  a 
term  E2  such  that  E1-*>E2,  and  S(F2,E2). 

(b)  If  there  are  two  N-reductions  starting  at  the  same  term  and  ending  in  terms  in 
simplified  form,  then  these  terms  are  identical,  and  the  N-reductions  are  of  equal 
length. 

(c)  Let  E0,El,..,En  be  a  successful  N-reduction.  Let  E0=F0,Fl,..,Fp,  be  an 
unsuccessful  N-reduction,  i.e.  Fp  is  not  simplified.  Then  p<n,  and  there  exists 
Fp+1  such  that  Fp  reduces  to  Fp+1  in  an  N-step. 

Now,  (a)  is  iterated  to  obtain  confluence,  (c)  requires  (b).  From  (c)  we  infer  that  if  a  term 
EO  has  a  successful  N-reduction  then  no  N-reduction  starting  at  EO  is  infinite,  or 
terminates  in  failure.  Hence,  every  N-reduction  must  terminate  in  a  term  in  simplified 
form.  Hence  directedness  for  simplified  forms.  Similarly,  directedness  for  normal  forms. 

Lemma  1.  Select  never  chooses  overlapping  terms.  Let  P  be  a  DF*  program.  Let  E 
and  F  be  terms  such  that  select(E,F).  Then,  for  all  G,  select(E,G)  implies  that  G  is  not 
properly  contained  in  F. 
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Proof.  By  induction  on  length  of  E.  As  before,  the  definition  of  select,  for  any  term 
f(Tl,..,Tn)  is: 

selectp(f(Tl,..,Tn),f(Tl,„,Tn))  if  f(Tl,..,Tn)=>pX. 
selectp(f(Tl,..,Ti,..,Tn)^C)  if 

there  is  a  rule  f(Ll,..,Li,..,Ln)=>RHS  in  P,  and 
there  is  no  substitution  a  such  that  Ti=Lic,  and 
selectp(Ti,X). 

If  E  is  a  O-ary  function  symbol,  the  lemma  holds.  Otherwise  let  E=f(tl,..,ti,..,tm)  and  let 
the  lemma  hold  for  each  of  tl,..,tm.  Suppose  select(E,F)  but  F*E.  Then  for  some  ti  in 
tl,..,tm,  select(ti,F).  Suppose  select(E,G).  If  G=E  then  G  is  clearly  not  contained  in  F. 
Otherwise,  for  some  tj  in  tl,..,tm,  select(tj,G).  If  j=i,  by  induction  hypothesis,  G  is  not 
properly  contained  in  F.  If  j*i,  of  course,  G  is  not  properly  contained  in  F. 

Suppose  select(E,F)  and  F=E.  Then  there  exists  a  rule  f(Ml,..,Mi,..,Mm)=>RHS  such 
that  E  matches  its  head.  Now  suppose  that  there  also  exists  G  such  that  select(E,G),  and 
G  is  properly  contained  in  F.  Hence,  for  some  ti  in  tl,..,tm,  select(ti.G).  Hence  there  is  a 
rule  f(Ll,..,Li,..,Lm)=>RHSl  such  that  ti  does  not  match  Li.  Hence  Li  is  not  a  variable. 
By  restriction  (g)  Mi  is  also  not  a  variable.  Hence  ti  is  in  simplified  form.  But  then 
select(ti,G)  fails.  Contradiction.  QED. 

Lemma  2.  Let  P  be  a  DF*  program.  Let  G  be  a  term.  Then  there  is  at  most  one  term  H 
such  that  G=>H. 

Proof.  By  restriction  (f).  QED. 

Lemma  3.  Let  o  and  x  be  two  substitutions  each  defining  only  the  variables  Xl,..,Xm, 
m>=0,  such  that  for  any  i=<m,  where  <Xi,si>  appears  in  a,  and  <Xi,ti>  in  x,  S(si,ti). 
Let  M  be  a  term,  possibly  containing  variables,  but  only  from  Xl,..,Xm.  Then 
S(Mo,Mx). 

Proof.  By  induction  on  length  of  M.  QED. 

Lemma  4.  Let  P  be  a  DF*  program.  Let  A=f(tl,..,tm),  m>0,  and  B=f(tl*,..,tm*),  such 
that  for  each  i=<m,  S(ti,ti*).  Let  A=>C.  Then  there  exists  D,  such  that  B=>D  and  S(CJD). 

Proof.  Let  A  reduce  to  C  by  the  rule  LHS=>RHS.  Then,  there  exists  a  substitution  a  such 
that  A=LHSa  and  C=RHSa.  It  is  easily  verified  that  there  exists  substitution  x  such  that 
B=LHSx,  o  and  x  define  the  same  variables,  and  for  each  i,  where  <Xi,pi>  appears  in  a 
and  <Xi,qi>  in  x,  S(pi,qi).  Hence  B=>RHSx=D.  By  Lemma  3,  S(C,D).  QED. 
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Lemma  5.  Let  P  be  a  DF*  program.  Let  FI 31, F2  be  terms  such  that  S(F1,E1),  and  Fl- 
>F2.  Then  there  exists  term  E2  such  that  E1-*>E2,  and  S(F2,E2). 

Proof.  By  induction  on  length  of  FI.  The  situation  can  be  visualized  in  the  following 
diagram: 


f: 

/  \ 

*  \ 

/  \ 

El  F2  S(F1,E1),  F1->F2 

\  / 

★  ★ 

\  / 

E2 

Casel.  FI  is  a  0-ary  function  symbol.  Since  F1->F2,  F1=>F2.  IfFl=El,  take 
E2=F2.  Then  El  =>E2,  and  S(F2,E2),  as  required.  Otherwise,  FI  =>E1.  By  restriction 
(f),  E1=F2.  Take  E2=F2.  Again,  E1-*>E2,  and  S(F2,E2),  as  required. 


Case  2.  Fl=f(tl,..,tm),  m>0.  Assume  Lemma  for  tl,..,tm. 

Case  2-1.  FI  reduces  to  El  as  a  whole.  If  FI  reduces  to  F2  as  a  whole,  by 
restriction  (f),  F2=E1.  TakeE2=El.  Then  E1-*>E2,  and  also  S(F2,E2),  as 
required.  Otherwise,  there  is  some  j=<m,  such  that  tj->tj*,  and 
F2=f(tl,..,tj*,..,tm).  Hence  S(F1,F2).  By  Lemma  4,  there  exists  E2  such  that 
F2=>E2  and  S(E1,E2).  But  then  E1-*>E2.  Since  F2=>E2,  S(F2,E2),  as  required. 

Case  2-2.  FI  does  not  reduce  as  a  whole  to  El.  Then  El=f(sl,..,sm),  and  for 
each  i=<m,  S(ti.si). 

If  F1=>F2,  then,  by  Lemma  4,  there  exists  E2  such  that  E1=>E2,  and  S(F2,E2),  as 
required.  Otherwise,  there  is  some  j=<m,  such  that  tj->tj*,  and  F2=f(tl,..,tj- 
l»tj*,tj+l,..,tm).  By  induction  hypothesis,  there  exists  p,  such  that  sj-*>p  and 
S(tj*,p).  Take  E2=f(sl,..,sj-l,p,sj+l,..,sm).  Clearly,  E1-*>E2,  and  also,  S(F2^2), 
as  required.  QED. 

Lemma  6.  Let  P  be  a  DF*  program,  and  M,N,P  be  terms  such  that  S(M,N)  and  M-*>P. 
Then  there  exists  term  Q  such  that  N-*>Q  and  S(P,Q). 
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Proof.  By  iterating  Lemma  5.  QED. 

Lemma  7.  Let  P  be  a  DF*  program,  and  M,N,P  be  terms  such  that  M->N,  and  M-*>P. 
Then  there  exists  term  Q  such  that  N-*>Q,  and  P-*>Q. 

Proof.  Since  M->N,  S(M,N).  By  Lemma  6,  there  exists  term  Q  such  that  N-*>Q  and 
S(P,Q).  Hence  P-*>Q,  as  required.  QED. 

Theorem  1.  Confluence  of  DF*.  Let  P  be  a  DF*  program,  and  M,N,P  be  terms  such  that 
M-*>N,  and  M-*>P.  Then  there  exists  term  Q  such  that  N-*>Q,  and  P-*>Q. 

Proof.  By  iterating  Lemma  7.  QED. 

Corollary.  Uniqueness  of  normal  forms.  Let  P  be  a  DF*  program.  Then  every  term  has 
at  most  one  normal  form. 

Lemma  8.  Let  R  be  an  N-reduction  f(pl ,..,pk)=EOvE  1  ,..,Em  such  that  for  no  i,  Ei=>Ei+l. 
Then  the  length  m  of  E0,E1  ,..,Em  is  equal  to  the  sum  of  the  lengths  of  R@  1 , 
R@2,..,R@k. 

Proof.  By  induction  on  m.  If  m=0,  then  clear.  Assume  lemma  for  EO,.., Em- 1.  There 
exists  exactly  one  n,  such  that  Em-l=f(tl,..,tn-l,tn,tn+l,..,tk),  Em=f(tl,..,tn- 
l,un,tn+l,..,tk),  and  tn  reduces  to  un  in  an  N-step.  So,  for  every  i,  i*n,  (E0,..,Em- 
l)@i=(EO,..,Em-l^m)@i.  Only  for  n,  (EO,..,Em-l,EmX®n=(EO,..JBm-l)@n:un.  By 
induction  hypothesis,  the  lemma  is  clear.  QED. 

Lemma  9.  Let  P  be  a  DF*  program.  Let  EO,El,..,En  be  a  successful  N-reduction.  Then 
for  every  successful  N-reduction  E0JFl,..,Fp,  p=n  and  Fp=En. 

Proof:  By  induction  on  n.  If  n=0  then  EO  is  simplified  and  the  lemma  holds  trivially.  Let 
n>l.  Assume  hypothesis  for  all  successful  N-reductions  of  length  less  than  n. 

Since  E0JEl,..,En  is  a  successful  N-reduction,  there  exists  Ek,  0=<k<n,  such  that  for  no  i, 
0=<i<k,  Ei=>Ei+l,  but  Ek=>Ek+l,  and  Ek+l,..,En  is  a  successful  N-reduction.  Let 
E0=f(tl,..,tm),  m>=0.  Since  n>0,  f  is  not  a  constructor  symbol.  Then  Ek=f(sl,..,sm)  for 
terms  sl,..,sm.  Similarly,  for  the  successful  N-reduction  FO,Fl,..,Fp,  there  exists  Fj  with 
properties  similar  to  those  of  Ek.  The  situation  can  be  laid  out  in  the  following  diagram: 

E0=f (t 1 , . ,,tm) - *>Ek=f (si, . . , sm) =>Ek+l--*>En 


F0=f(tl, ..,tm) - *>F j=f (ql , ..,qm) =>F j+1 — *>Fp 
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Consider  any  ti  in  tl,..,tm. 

Case  1.  ti  is  simplified.  Then,  by  definition  of  select,  ti=si.  Similarly,  ti=qi.  Hence  qi=si. 

Case  2.  ti  is  unsimplified.  If  si  is  simplified,  then  there  exists  a  successful  N-reduction 
(E0,..,Ek)@i,  of  length  less  than  n. 

By  restriction  (g),  the  ith  argument  in  the  head  of  any  rule  in  P,  defining  f,  is  a  non¬ 
variable.  Hence,  qi  is  also  simplified.  Hence  there  exists  a  successful  N-reduction 
(FO,..  JFj)@i.  By  induction  hypothesis,  its  length  is  equal  to  that  of  (E0,..,Ek)@i,  and 
qi=si. 


If  si  is  unsimplified,  then,  by  restriction  (g),  and  definition  of  select,  ti=si=qi. 

Hence  Ek=Fj.  By  Lemma  8,  the  length  of  EO,..,Ek  is  the  sum  of  lengths  of  (EO,..,Ek)@i 
such  that  ti  is  unsimplified  but  si  is  simplified.  Again,  by  Lemma  8,  and  Case  2,  this  is 
also  the  length  of  F0,..,Fj.  Hence,  k=j.  By  restriction  (0,  Ek+l=Fj+l.  Now,  Ek+l,..JEn, 
is  a  successful  N-reduction  of  length  less  than  n.  By  induction  hypothesis,  its  length  is 
equal  to  that  of  the  successful  N-reduction  Fj+l,..,Fp,  and  En=Fp.  Hence,  also,  the  length 
of  E0,..,En  is  equal  to  that  of  F0,..,Fp.  QED. 

Lemma  10.  Let  P  be  a  DF*  program.  Let  E0,El,..,En  and  E0=F0FL-Fm  be  two  NR- 
reductions  such  that  En  and  Fm  are  in  normal  form.  Then  En=Fm  and  n=m. 

Proof:  Note  that  En=Fm  follows  directly  from  the  confluence  of  DF*.  We  focus  on 
showing  that  n=m,  and  proceed  by  induction  on  length  of  E0,El,..,En.  If  n=0,  then  clear. 
Otherwise,  let  n>0  and  assume  the  lemma  for  NR -reductions  of  length  less  than  n. 

Case  1.  E0  is  unsimplified.  Then,  there  exists  Ek,  0<k=<n  such  that  E0,..,Ek  is  a 
successful  N-reduction.  Also,  there  exists  Fj,  0<j=<m  such  that  E0,..,Fj  is  a  successful 
N-reduction.  By  Lemma  9,  Fj=Ek  and  j=k.  Now  Ek,..,En,  and  Fj,..,Fm  are  also 
NR-reductions.  The  length  of  Ek,..,En  is  less  than  n,  so  by  induction  hypothesis,  n=m. 

Case  2.  E0  is  simplified.  Then  E0=c(tl,..,tq)  for  terms  tl,..,tq  and  constructor  symbol  c. 

If  E0  is  in  normal  form,  then  the  theorem  holds.  Otherwise,  let  A0,Al,..,Ap  be 
unsimplified  terms  in  E0  such  that  no  Ai  is  properly  contained  in  any  unsimplified  term. 
Consider  any  Ai  in  A0,Al,..,Ap. 

Let  the  position  at  which  Ai  occurs  in  E0  be  ui.  Then,  since  En  is  in  normal  form, 
(E0,..,EnX§>ui  is  an  NR-reduction  ending  in  a  normal  form.  Similarly  obtain 
(F0,..,Fm)@ui. 
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By  reasoning  as  in  Case  1,  the  length  of  (EO,..,En)@ui  is  equal  to  that  of  (F0,..,Fm)@ui. 
It  can  be  shown,  analogously  to  Lemma  8,  that  the  length  of  EO,..,En  is  equal  to  the  sum 
of  the  lengths  of  each  (E0,..,EnXa>ui.  Similarly,  for  length  of  FO,..,Fm.  Hence  m=n. 
QED. 

Lemma  11.  Let  P  be  a  DF*  program  and  EO  a  term.  Let  E0,El,..,En  be  a  successful  N- 
reduction.  Let  E0=F0,Fl,..,Fp,  be  an  unsuccessful  N-reduction,  i.e.  Fp  is  not  simplified. 
Then  p<n,  and  there  exists  Fp+1  such  that  Fp  reduces  to  Fp+1  in  an  N-step. 

Proof:  By  induction  on  the  length  of  E0,..,En.  If  n=0  then  EO  is  in  simplified  form  and 
the  only  N-reduction  starting  at  EO  is  EO  itself,  so  the  lemma  is  clear.  Let  n>0.  Then  EO 
is  not  simplified.  Assume  lemma  for  all  successful  N-reductions  of  length  less  than  n. 
Since  EOJEl,..,En  is  a  successful  N-reduction,  there  exists  Ek,  0=<k<n  such  that  for  no  i, 
0=<i<k,  Ei=>Ei+l,  but  Ek=>Ek+l,  and  Ek+l,..,En  is  a  successful  N-reduction. 
Let  EO=f(tl,..,tm),  m>=0.  Let  Ek=f(sl,..,sm).  We  have  two  cases: 

Case  1.  There  exists  Fj,  0=<j<p  such  that  for  no  i,  0=<i<j,  Fi=>Fi+l,  but 
Fj=>Fj+l,  and  Fj+l,..JFp  is  an  N-reduction.  Let  Fj=f(ql,..,qm).  The  situation  can  be 
visualized  in  the  following  diagram: 

E0=f (tl, . . , tm) — *  >Ek=f (si, . . , sm) =>Ek+l  —  *>En 

F0  =  f  \t 1 ,  . . , tm) — *>F j  =  f (ql,  . . , qm) =>F j  +  1  —  *>Fp 

By  reasoning  as  in  Lemma  9  above,  Ek=Fj,  and  k=j.  By  restriction  (0,  Ek+l=Fj+l.  By 
induction  hypothesis,  p<n.  Furthermore,  there  exists  Fp+1  such  that  Fp  reduces  to  Fp+1 
in  an  N-step. 

Case  2.  There  does  not  exist  Fj  in  F0,..,Fp  such  that  Fj=>Fj+l.  Let  Fp=f(ql,..,qm).  The 
situation  can  be  visualized  as: 

E0=f (tl, . . , tm) — *>Ek=f (si, . . , sm) =>Ek+l — *>En 

F0  =  f (tl, . . , tm) — *  >Fp=f (ql, . . , qm)  (unsimplified) 

It  is  easily  seen  that  either  Fp=>Fp+l,  or  there  exists  i  in  l,..,m  such  that  qi  is  not 
simplified,  but  si  is.  By  induction  hypothesis,  there  exists  ri  such  that  qi  reduces  to  ri  in 
an  N-step.  Hence  Fp  reduces  to  f(ql,..,ri,..,qm)  in  an  N-step.  Also,  by  Lemma  8,  and 
induction  hypothesis,  pen.  QED. 

Theorem  2.  Directedness  for  simplified  forms.  Let  P  be  a  DF*  program.  Let  EO 
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be  a  term  and  let  EO,..,En  be  a  successful  reduction.  Then  any  N-reduction  starting  at 
EO,  if  extended  far  enough,  would  terminate  in  a  term  in  simplified  form. 

Proof.  By  reduction-completeness  for  simplified  forms,  (Theorem  2,  Section  II),  and 
iterating  Lemma  11.  QED. 

Lemma  12.  Let  P  be  a  DF*  program  and  EO  a  term.  Let  E0JEl,..,En  be  an  NR -reduction 
such  that  En  is  in  normal  form.  Let  E0=F0,F1,..  ,Fp,  be  an  NR-reduction  such  that  Fp  is 
not  in  normal  form.  Then,  p<n,  and  there  exists  Fp+1  such  that  Fp  reduces  to  Fp+1  in  an 
NR-step. 

Proof:  By  induction  on  length  of  E0,El,..,En.  If  n=0,  then  clear.  Otherwise,  assume 
Lemma  for  all  NR-reductions  of  length  less  than  n,  and  ending  in  normal  forms. 

Case  1.  EO  is  unsimplified.  By  a  reasoning  very  similar  to  that  in  proof  of  Lemma  11. 

Case  2.  EO  is  simplified.  If  EO  is  in  normal  form,  the  lemma  trivially  holds.  Otherwise, 
as  in  Case  2  of  Lemma  10.  QED. 

Theorem  3.  Directedness  for  normal  forms.  Let  P  be  a  DF*  program.  Let  EO  be  a 
term  and  let  E0,..,En  be  a  reduction  where  En  is  in  normal  form.  Then  any  NR-reduction 
starting  at  EO,  if  extended  far  enough,  would  terminate  in  a  normal  form. 

Proof  By  reduction-completeness  for  normal  forms,  (Theorem  4,  Section  II),  and 
iterating  Lemma  12.  QED. 
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IV.  1  INTRODUCTION 

Intuitively,  it  can  be  seen  that  in  an  N-reduction  a  term  is  reduced  only  when  it  is 
necessary  for  simplifying  the  first  term  in  the  reduction.  In  this  sense,  an  N-reduction 
conserves  computation.  For  example,  with  the  rules: 

f(X)=>[X]. 

a=>[]. 


there  exists  the  N-reduction  f(a),[a].  There  is  no  N-reduction  starting  at  f(a)  in  which  a  is 
reduced.  However,  it  can  also  happen  that  in  an  N-reduction,  several  copies  of  the  same 
term  are  reduced.  This  can  happen  when  use  is  made  of  a  rule  in  which  a  variable  occurs 
more  than  once  on  the  right  hand  side.  In  this  sense,  an  N-reduction  wastes  computation. 
For  example,  with  the  rules: 

f(X)=>g(X,X). 

g([],  [])=>[]• 
a=>[]. 


there  exists  the  N-reduction  f(a),g(a,a),g([],a),g([], []),[].  The  two  occurrences  of  a  in  the 
second  term  are  copies  of  each  other,  yet  they  are  reduced  separately.  This  is  the  same 
problem  which  arises  with  a  call-by-name  procedure  call  mechanism  in  programming 
languages. 

If  we  can  arrange  that  when  a  term  is  reduced,  all  copies  of  it  are  also  reduced,  then  N- 
reductions  could  become  considerably  shorter.  In  fact,  it  is  shown  that  they  become 
minimal.  An  N-step  in  which  all  copies  of  a  term  are  replaced  is  called  an  NA-step.  Thus 
it  is  distinguished  from  an  N-step  in  which  only  a  single  copy  of  a  term  is  replaced.  A 
sequence  of  NA-steps  is  called  an  NA-derivation.  The  prefix  NA  stands  for  "normal-all". 
Minimality  yields  a  strong  form  of  laziness,  since  terms  are  simplified  with  minimum 
computational  effort. 

The  notion  of  a  copy  of  a  term,  however,  has  two  legitimate  interpretations.  The  first  is 
simply  that  any  two  occurrences  of  a  subterm  in  a  term  are  copies  of  each  other. 

The  second  is  obtained  from  examining  representations  of  terms  as  directed  acyclic 
graphs.  A  0-ary  function  symbol  f  is  represented  as  a  graph  consisting  of  just  a  single 
node  with  f  stored  in  it.  A  term  f(tl,..,tn)  is  represented  as  a  graph  whose  root  is  a  node 
with  n+1  fields.  The  first  field  stores  f  and  for  each  l=<i=<n,  the  ith  field  stores  a  pointer 
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to  the  graph  representation  of  ti.  A  term  can  have  many  graph  representations.  For 
example,  the  two  occurrences  of  a  in  f(a,a)  can  be  represented  by  a  single  graph,  or  by 
distinct  graphs.  Now,  two  occurrences  of  a  subterm  in  term  E  are  said  to  be  copies  of 
each  other  only  if,  in  the  graph  representation  of  E,  they  have  the  same  graph 
representation. 

We  adopt  the  second  interpretation  since  it  enables  us  to  develop  a  simple  proof  of 
minimality  and  also  to  implement  replacement  of  all  copies  of  a  subterm  with  a  small 
overhead.  Graph  representations  of  terms  are,  in  turn,  represented  using  labeled  terms. 
The  address  of  each  node  in  a  graph  is  represented  by  a  label.  Let  there  be  a  graph  G  with 
root  node  N.  Let  N  contain  m+1  fields,  where  the  first  field  contains  the  symbol  f  and  the 
rest  of  the  m  fields  contain  pointers  to,  respectively,  graphs  Gl,..,Gm.  Let  the  address  of 
N  be  represented  by  label  a.  Then  the  representation  of  G  is  the  labeled  term  f(a,tl,..,tm) 
where  for  each  i,  the  representation  of  Gi  is  the  labeled  term  ti. 

A  subset  of  DF*  called.  Labeled  Deterministic  F*  (LDF*),  is  defined.  Notions  of  labels 
[Vuillemin  1974],  labeled  terms,  ordinary  terms,  and  ordinary  programs  are  introduced. 
Reductions  in  LDF*  are  intended  to  mimic  graph  reduction.  In  particular,  it  is  ensured 
that  whei:  a  new  node  is  allocated  in  graph  reduction,  a  label  not  previously  used  in  the 
LDF*  reduction  is  generated. 

LDF*  is  shown  to  be  minimal  in  the  following  sense:  where  P*  is  an  LDF*  program,  and 
E  a  proper  term,  let  there  be  a  shortest  successful  reduction  of  E.  Then  there  is  a 
successful  NA-derivation  of  E  of  lesser  or  equal  length. 

It  is  also  shown  that  with  each  ordinary  DF*  program  P,  one  can  associate  an  LDF* 
program  P*,  such  that  if  there  is  a  successful  reduction  in  P,  there  is  a  successful 
reduction  in  P*  of  exactly  equal  length.  Hence,  to  simplify  terms  in  P  in  a  minimum 
number  of  steps,  it  is  sufficient  to  transform  P  to  P*  and  use  NA-derivations. 

Some  main  ideas  in  our  proof  are  (a)  in  each  reduction  step,  a  label  is  eliminated,  so  (b) 
the  size  of  the  elimination-set  (E-set)  of  a  reduction,  i.e.  the  set  of  labels  eliminated  in  the 
reduction,  is  a  lower-bound  on  its  length,  (c)  the  size  of  the  E-set  of  an  NA-derivation  of 
a  proper  term,  is  exactly  equal  to  its  length. 

IV.2  DEFINITION  OF  LDF* 

Labels.  Let  a,  P,  al,  (31,  £l,..  be  an  enumerably  infinite  subset  of  the  set  of  O-ary 
function  symbols  in  F*.  Each  member  of  this  list  is  called  a  primitive  label. 

Let  *  be  a  binary  function  symbol  in  F*.  A  label  is  defined  as  follows.  A  primitive  label 
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is  a  label.  If  x  and  y  are  labels  then  x*y  is  also  a  label.  A  label  a  is  said  to  be  a  proper 
initial  segment  of  label  (3  if  either  (3=a*8,  or  (3=^*8  and  a  is  a  proper  initial  segment  of  £. 

Labeled  terms.  Where  f  is  an  n+l-ary  function  symbol,  n>=0,  f**,  a  a  label  and  tl,..,tn 
labeled  terms,  f(a,tl,..,tn)  is  a  labeled  term,  a  is  called  the  outermost  label  of  f(a,tl,..,tn). 

For  example,  where  f  is  a  4-ary  function  symbol  and  a,b  are  1-ary  function  symbols, 
f(5,a(a),b((3),a(^))  is  a  labeled  term,  and  8  is  the  outermost  label  of  this  term.  Note  that  a 
label  standing  alone  is  not  a  labeled  term.  Neither  is  a  labeled  term  of  the  form  A*B. 

Also,  note  that  a  labeled  term  never  contains  any  variables. 

A  labeled  term  is  said  to  be  in  normal  form  if  it  contains  only  constructor  symbols  and 
labels. 

Maximal  labels.  A  label  is  maximal  in  a  labeled  term  if  it  is  not  a  proper  initial  segment 
of  any  other  label  in  that  term. 

Proper  terms.  A  labeled  term  E  is  called  proper  if  (a)  all  its  labels  are  maximal,  and  (b) 
for  every  two  subterms  A  and  B  of  E,  if  A  and  B  have  the  same  outermost  label  then 
A=B.  For  example,  f(a,b(£,),c(8))  is  a  proper  term.  However,  g(a,a(a*8))  is  not  a  proper 
term  since  it  violates  (a),  and  f(a,b(a))  is  not  a  proper  term  since  it  violates  (b). 

Ordinary  terms,  ordinary  rules,  and  ordinary  programs.  A  term  in  F*,  possibly 
containing  variables,  a  rule  in  F*,  or  an  F*  program,  is  said  to  be  ordinary  if  it  does  not 
contain  any  labels,  nor  any  occurrence  of  the  symbol  *. 

A  mapping  £.  Let  F  be  the  set  of  all  function  symbols  in  F*,  except  *,  and  the  primitive 
labels.  Let  there  be  an  injection  £  between  F  and  F  which  maps  each  n-ary  function 
symbol  in  F  to  an  n+l-ary  function  symbol  in  F.  Moreover,  £  always  maps  a  constructor 
symbol  to  a  constructor  symbol,  and  a  non-constructor  symbol  to  a  non-constructor 
symbol. 

Labeled  versions  of  ordinary  terms,  possibly  containing  variables.  Let  E  be  an 
ordinary  term,  possibly  containing  variables.  If  E  is  a  variable  then  its  labeled  version  is 
E  itself.  Otherwise,  let  E=f(tl,..,tn),  n>=0.  Its  labeled  version  is  f_(a,tl_,..,tnj  where  a 
is  a  label,  £(f)=f_,  and  for  each  i,  ti_  is  a  labeled  version  of  ti.  For  example,  where  £ 
maps  f  to  f_  and  a  to  a_,  a  labeled  version  of  f(a,a)  is  f_(ct,a_((3),a_(8)). 


Labeled  versions  of  ordinary  rules.  Let  LHS=>RHS  be  an  ordinary  F*  rule.  A  labeled 
version  of  this  rule,  LHS*=>RHS*,  is  defined  as  follows: 
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Let  LHS=f(Ll,..,Lm).  Then  LHS*=f_(L,Ll_,..,Lm_)  satisfying  the  following  conditions: 
(a)  f_=E(f),  (b)  L  is  a  variable,  (c)  If  Li  is  a  variable,  Li_=Li,  otherwise  Li=c(Xl,..,Xm), 
and  Li_=c_(Ki,X  1  ,..,Xm),  Ki  a  variable,  I(c)=c_,  and  (d)  a  variable  occurs  at  most  once 
in  LHS*. 

Let  RHS1  be  a  labeled  version  of  RHS  in  which  all  labels  are  distinct,  and  for  no  two  of 
these  is  one  a  proper  initial  segment  of  the  other.  Let  these  labels  be  (31,.., (3k.  Then, 
v'here  LHS*=f(L,Ll_,..,Lm_),  RHS*  is  obtained  by  replacing,  in  RHS1,  each  Pi  by  L*pi. 

Labeled  Deterministic  F*  Programs.  Let  P  be  an  ordinary  DF*  program,  and  let  P* 
consist  of  labeled  versions  of  rules  in  P.  Then  P*  is  called  a  labeled  deterministic  F* 
(LDF*)  program.  For  example,  where  P  consists  of: 

append(nil,X)=>X 

append(cons(U,V),W)=>consCU,append(V,W)) 
and  X  maps  append,  nil,  and  cons  to  append_,  nil_  and  cons_  respectively,  P*  consists  of: 
append_(L,nil_(L  1  ),X)=>X. 

append_(L,cons_(K  1  ,U,  V),W)=>cons_(L*  P 1  ,U,append_(L*  p2,  V,W)) 

where  pi,  and  P2  are  distinct  labels,  and  neither  is  a  proper  initial  segment  of  each  other. 
Note  that  each  LDF*  program  is  a  DF*  program  as  well  as  an  F*  program.  Also,  each 
labeled  term  is  an  F*  term.  Hence,  results  of  all  previous  sections  also  hold  for  LDF* 
programs  and  labeled  terms. 

NA-steps  and  NA-derivations.  Let  P  be  an  LDF*  program  and  E,G,H  be  labeled  terms. 
Suppose  selectp(E,G)  and  G=>pH.  Let  El  be  the  result  of  replacing  all  occurrences  of  G 
by  H  in  E.  Then  we  say  that  E  reduces  to  El  in  an  NA-step  in  P.  The  prefix  NA  in  N- 
step  stands  for  "normal-all".  A  sequence  of  labeled  terms  E0,E1,..  is  an  NA-derivation  if 
for  each  i,  when  Ei,  and  Ei+1  both  exist,  Ei  reduces  to  Ei+1  in  an  NA-step.  For  example, 
given  the  rule  a(L)=>b(L*a),  the  term  f(P,a(8),a(5))  reduces  in  an  NA-step  to  f(P,b(8*a), 
b(8*a)). 

Leftmost  steps  and  reductions.  Let  P  be  an  F*  program,  not  necessarily  ordinary.  Let 
E  be  a  term,  and  G  and  G 1  two  of  n.  subterms.  G  is  said  to  be  to  the  left  of  G 1  in  E,  if 
either  (a)  they  both  occur  at  the  same  position  in  E,  or  (b)  in  the  depth-first  or  preorder 
traversal  of  the  tree  representation  of  E,  the  function  symbol  which  is  the  root  of  G  occurs 
before  the  function  symbol  which  is  the  root  of  Gl. 


Let  select(E,G),  G=>H.  Then  E  reduces  to  F  in  a  leftmost  N-step,  if  for  every  Gl, 
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select(E,Gl)  implies  G  is  to  the  left  of  G1  in  E,  and  F=E[G/H]. 

Let  select(E,G),  G=>H.  Then  E  reduces  to  F  in  a  leftmost  NA-step,  if  for  every  Gl, 
select(E,Gl)  implies  G  is  to  the  left  of  Gl  in  E,  and  F  is  the  result  of  replacing  all 
occurrences  of  G  in  E  by  H. 

Definitions  of  leftmost  N-reductions  and  leftmost  NA-derivations  are  the  obvious  ones. 

Elimination  sets  or  E-sets.  Let  AO, A 1,.., An  be  a  reduction  where  each  Aj  is  a  labeled 
term.  For  any  i,  let  Ai+l=Ai[G/H]  where  G=f(a,tl,..,tm).  Then  we  say  that  the 
function-label  pair  (FL-pair)  <f,a>  has  been  eliminated  in  the  reduction  Ai,Ai+l.  The 
elimination  set  or  E-set  of  a  reduction  is  defined  as  the  set  of  all  FL-pairs  eliminated  in 
the  reduction.  Since  elimination  of  an  FL-pair  requires  one  reduction  step,  the  size  of  this 
set  (number  of  elements  in  it)  is  a  lower  bound  on  the  length  of  the  reduction  (the  number 
of  steps  in  it). 

Since  an  NA-step  can  be  thought  of  as  a  sequence  of  reductions  steps,  an  NA-derivation 
can  be  thought  of  as  a  reduction.  Hence,  the  E-set  of  an  NA-derivation  is  the  E-set  of  the 
corresponding  reduction. 

IV.3  MINIMALITY  OF  LDF* 

Let  P  be  an  LDF*  program  and  E  a  labeled  term.  We  already  know  from  completeness  of 
DF*  that  if  E  has  a  successful  reduction,  it  has  a  successful  N-reduction.  By  directedness 
of  DF*,  it  has  a  successful  leftmost  N-reduction.  We  now  show  the  following: 

(a)  If  E  has  a  successful  reduction  RO,  E  has  a  successful  N-reduction  R1  whose 
E-set  is  a  subset  of  the  E-set  of  RO. 

(b)  If  R1  and  R 2  are  two  successful  N-reductions  of  E,  their  E-sets  are  identical. 

(c)  If  E  has  a  successful  leftmost  N-reduction  R2,  it  has  a  successful  leftmost 
NA-derivation  R3.  Furthermore,  the  E-set  of  R3  is  a  subset  of  the  E-set  of  R2. 

(d)  If  E  is  a  proper  term,  the  size  of  the  E-set  of  any  NA-derivation  starting  at  E  is 
equal  to  the  number  of  NA-steps  in  that  derivation. 

(e)  Let  P  be  an  ordinary  DF*  program,  and  P*  its  labeled  version.  Let  EO  be  an 
ordinary  term,  and  EO*  a  labeled  version  of  EO.  Let  E0,E1,E2,...  be  a  reduction  in 
P.  Then  there  exists  a  reduction  E0*,E1*,E2*,...  in  P*,  such  that  for  each  i,  Ei*  is 
a  labeled  version  of  Ei. 
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Let  E  be  a  proper  term.  Let  RO  be  a  shortest  successful  reduction  of  E.  Then  its  length  is 
greater  than  or  equal  to  the  size  of  its  E-set.  By  (a),  there  exists  Rl,  an  N-reduction  of  E 
whose  E-set  is  a  subset  of  that  of  RO.  By  directedness  of  DF*,  there  exists  R2,  a 
successful  leftmost  N-reduction  of  E.  By  (b),  the  E-set  of  Rl  is  identical  to  that  of  R2. 

By  (c),  there  exists  R3,  a  successful  leftmost  NA-derivation  of  E  whose  E-set  is  a  subset 
of  that  of  R2.  By  (d),  the  length  of  this  NA-derivation  is  at  most  the  length  of  RO. 

Hence,  leftmost  NA-derivations  are  minimal  for  simplifying  proper  terms. 

Now,  let  P  be  an  ordinary  DF*  program  and  P*  its  labeled  version.  Let  there  be  a 
successful  reduction  R  of  a  term  E  in  P.  By  (e)  there  exists  a  successful  reduction  R*  of  a 
labeled  version  E*  of  E.  This  version  can  always  be  chosen  to  be  proper.  By  minimality 
of  LDF*,  there  is  a  successful  NA-derivation  starting  at  E*  of  length  less  than  or  emal  to 
that  of  R*  or  R.  Hence,  to  simplify  terms  in  a  minimum  number  of  steps,  it  is  suf  ncient  to 
transform  P  to  P*  and  use  leftmost  NA-derivations.  This  reasoning  is  now  carried  out 
formally  and  in  detail. 

IV.3.1  Existence  of  successful  leftmost  NA-derivations 

Lemma  1.  Let  P  be  an  LDF*  program.  Let  E0,F0  be  labeled  terms  such  that  E0->F0.  If 
EO  reduces  to  El  in  a  leftmost  N-step  then  there  exists  FI  such  that  E1-*>F1,  and  (a) 
either  F1=F0,  or  (b)  FO  reduces  to  FI  in  a  leftmost  N-step,  and  the  FL-pair  eliminated  in 
F0,F1  is  the  same  as  that  eliminated  in  E0,E1. 

Proof.  By  induction  on  length  of  EO.  We  can  draw  the  following  diagram: 

EO - >  FO 

I  I 

leftmost  |  F0=F1  or 

N-step  I  FO  reduces  to  FI  in  a  leftmost  N-step 

I  I 

El - *>  FI 

Case  1.  E0=f(a)  for  some  1-ary  function  symbol  f  and  label  a.  Since  E0->F0,  E0=>F0. 
So,  select(E0,E0)  and  due  to  restriction  (f),  E1=F0.  Take  F1=F0.  Clearly,  E1-*>F1. 

Case  2.  EO=f(a,tl,-..tm),  for  some  m+l-ary  function  symbol  f,  m>0,  label  a,  and  labeled 
terms  tl,..,tm. 

Case  2-1.  E0=>F0.  Similar  to  Case  1.  E1=F1=F0  and  E1-*>F1. 


Case  2-2.  Not  E0=>F0.  Then  F0=f(a,tl*,..,tm*),  and  there  exists  j  such  that  tj- 
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>tj*  and  for  each  i,  i*j  implies  ti=ti*. 

Suppose  E0=>E1.  Then  the  FL-pair  eliminated  in  E0,E1  is  <f,ot>.  It  is  easily 
verified  by  induction  on  length  of  EO,  that  F0=>F1  and  E1-*>F1.  Also,  FO 
reduces  to  FI  in  a  leftmost  N-step.  Finally,  the  FL-pair  eliminated  in  F0,F1  is  also 
<f,a>. 

Suppose  not  E0=>E1.  Then,  there  is  some  k,  such  that  El=f(tl,..,tk- 
l,sk,tk+l,..,tm),  and  tk  reduces  to  sk  in  a  leftmost  N-step.  By  induction 
hypothesis,  there  exists  sk*  such  that  sk-*>sk*,  and  either  tk*=sk*,  or  tk*  reduces 
to  sk*  in  a  leftmost  N-step,  and  the  FL-pair  eliminated  in  tk,sk  is  the  same  as  that 
eliminated  in  tk*,sk*. 

If  tk*=sk*,  let  F1=F0.  Clearly,  El -*>F1.  Otherwise,  let  Fl=f(oc,tl*,..,tk- 
l*,sk*,tk+l*,..,tm*).  Since  tk*  reduces  to  sk*  in  an  N-step,  tk*  is  not  simplified. 
Hence,  using  restriction  (g),  it  is  easily  verified  that  FO  reduces  to  FI  in  a  leftmost 
N-step.  In  particular,  in  this  step,  tk*  reduces  to  sk*  in  a  leftmost  N-step.  Hence, 
E1-*>F1,  and  the  FL-pair  eliminated  in  E0,E1  is  the  same  as  that  eliminated  in 
F0.F1.  QED. 

Lemma  2.  Let  P  be  an  LDF*  program.  Let  E0,F0  be  labeled  terms  such  that  E0-*>F0.  If 
EO  reduces  to  El  in  a  leftmost  N-step  then  there  exists  FI  such  that  E1-*>F1,  and  (a) 
either  F1=F0,  or  (b)  FO  reduces  to  FI  in  a  leftmost  N-step,  and  the  FL-pair  eliminated  in 
F0,F1  is  the  same  as  that  eliminated  in  E0,E1. 

Proof.  By  induction  on  length  of  the  reduction  E0,..,F0,  and  using  Lemma  1.  QED. 

Lemma  3.  Let  P  be  an  LDF*  program.  Let  E0,F0  be  labeled  terms  such  that  E0-*>F0  and 
let  there  be  a  successful  leftmost  N-reduction  E0,El,..,En.  Then  there  is  a  successful 
leftmost  NA-derivation  F0,Fl,F2,..,Fk  whose  E-set  is  a  subset  of  that  of  E0,El,..,En. 

Proof.  By  induction  on  length  n  of  E0,..,En,  and  using  Lemma  2.  QED. 

Theorem  1.  Let  P  be  an  LDF*  program.  Let  EO  be  a  labeled  term  and  EO,El,..,En  a 
successful  leftmost  N-reduction.  Then  there  is  a  successful  leftmost  NA-derivation 
E0,Fl,F2,..,Fk  whose  E-set  is  a  subset  of  that  of  E0,El,..,En. 


Proof.  Since  E0-*>E0,  apply  Lemma  3.  QED. 
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IV.3.2  E-sets  of  N-reductions 

Lemma  4.  Let  P  be  an  LDF*  program  and  E1,F1,G,H  be  labeled  terms  such  that: 

(a)  R(G,H,E1,F1),  and 

(b)  FI  reduces  to  F2  in  an  N-step,  and 

(c)  The  outermost  function  symbol  and  label  of  G  are,  respectively,  g  and  a,  and 

(d)  <r,P>  is  the  FL-pair  eliminated  in  the  reduction  F1,F2. 

Then  there  exists  an  N-reduction  E1,..,E2  such  that  its  E-set  is  included  in  {<g,a>,<r,(J>} 
and  R(G,H,E2,F2). 

Proof.  Exactly  parallel  to  proof  of  Theorem  1 ,  Section  H  QED. 

Lemma  5.  Let  P  be  an  LDF*  program.  Let  El,Fl,GJi,  be  labeled  terms  such  that 
R(G,H,E1,F1).  Let  the  outermost  function  symbol  and  label  of  G  be  g  and  a 
respectively.  Let  Fl,F2,..,Fm  be  a  successful  N-reduction.  Then  there  exists  a  successful 
N-reduction  El,..,En  whose  E-set  is  contained  in  the  union  of  {<g,a>}  and  the  E-set  of 
Fl,F2,..,Fm. 

Proof.  By  induction  on  length  of  Fl,..JFm.  QED. 

Lemma  6.  Let  El,Fl,G2,..,Gm  be  a  successful  reduction.  Then  there  is  a  successful  N- 
reduction  El,..,En  such  that  the  E-set  of  El,..,En  is  contained  in  that  of  El,Fl,G2,..,Gm. 

Proof.  By  induction  on  length  of  El,Fl,G2,..,Gm,  and  Lemma  5.  QED. 

Lemma  7.  Let  P  be  an  LDF*  program.  Let  EO  be  a  labeled  term  and  E0JEl,..,En  and 
E0JF1,.. ,Fp  two  successful  N-reductions.  Then,  the  E-set  of  one  is  identical  to  that  of  the 
other. 

Proof.  Exactly  analogous  to  the  proof  of  Lemma  9,  Section  ID,  that  any  two  successful 
N-reductions  of  a  term  are  of  equal  length,  and  end  in  the  same  simplified  form.  QED. 

rv.3.3  Reductions  of  proper  terms 

Lemma  8.  Let  P  be  an  LDF*  program.  Let  E  be  a  proper  term  and  let  E  reduce  to  F  in  an 
NA-step.  Then  all  labels  of  F  are  maximal. 

Proof.  Let  select(E,G).  Then  G=>H  and  F  is  obtained  by  replacing  all  occurrences  of  G 
in  E  by  H.  Let  the  rule  by  which  G=>H  be  LHS=>RHS,  and  let  G=g(a,tl,..,tm),  m>=0, 
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and  each  ti  a  labeled  term.  Take  any  two  labels  p  and  £  in  F.  There  are  four  cases. 

Case  1.  P  and  %  are  both  in  E.  Since  E  is  proper,  these  labels  are  not  proper  initial 
segments  of  each  other. 

Case  2.  Only  P  is  in  E.  Then,  by  the  nature  of  LDF*  rules,  ^=a*5  for  some  label  8  in 
RHS.  Since  E  is  proper,  P  and  a  are  not  proper  initial  segments  of  each  other.  If  P*a 
then  P  and  a*S  are  also  not  proper  initial  segments  of  each  other. 

Suppose  p=a.  Since  P  occurs  in  E,  E  has  a  subterm  f(P,sl,..,sn),  n>=0.  Since  E  is 
proper,  f(P,sl,..,sn)=G.  But  since  all  occurrences  of  G  are  replaced  by  H,  P  cannot  occur 
in  F,  as  assumed.  Hence  this  subsubcase  cannot  arise. 

Case  3.  Only  £,  is  in  E.  Same  as  case  2. 

Case  4.  None  of  P  and  ^  is  in  E.  Then,  by  definition  of  LDF*  rules,  P=a*8  and  £=a*e, 
for  some  labels  5  and  £  in  RHS.  Since  8  and  £  are  not  proper  initial  segments  of  each 
other,  neither  are  P  and  QED. 

Lemma  9.  Let  P  be  an  LDF*  program.  Let  E  be  a  proper  term  and  let  E  reduce  to  F  in  an 
NA-step.  Let  A  and  B  be  two  subterms  of  F  such  that  the  outermost  label  of  A  and  of  B 
is  p.  Then,  A=B. 

Proof.  Let  select(E,G).  Then  G=>H  and  F  is  obtained  by  replacing  all  occurrences  of  G 
in  E  by  H.  Let  the  rule  by  which  G=>H  be  LHS=>RHS,  and  let  the  outermost  label  of  G 
be  a.  There  are  four  cases. 

Case  1.  A,  but  not  B,  is  a  subterm  of  H.  Let  the  label  of  A,  and  of  B  be  p.  Since  B  is  not 
a  subterm  of  H,  there  occurs  B 1  in  E,  with  label  p,  such  that  B  is  the  result  of  replacing 
all  occurrences  of  G  in  B1  by  H. 

Since  A  is  a  subterm  of  H,  P  occurs  in  H.  However,  p;*a*S  since  a,  and  P  both  occur  in 
E,  and  E  is  proper.  Hence,  P  occurs  in  G.  But  then,  since  E  is  proper,  B  cannot  properly 
contain  G.  Hence  B1=B,  so  B  occurs  in  E. 

Now  P  is  also  the  label  of  A,  P*a*8,  and  A  occurs  in  H.  Hence  A  occurs  in  G,  and  so  in 
E.  Since  E  is  proper,  A=B,  as  required. 

Case  2.  B,  but  not  A,  is  a  subterm  of  H.  Then,  as  in  the  previous  case,  B  occurs  in  E. 
Hence  A=B. 
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Case  3.  Both  A  and  B  are  subterms  of  H.  Then  A  and  B  are  also  contained  in  G. 

Suppose  A  is  not,  but  B  is.  Then,  P=a*8.  Since  B  occurs  in  G,  [3  also  occurs  in  E. 
Contradiction  with  E  is  proper.  Similarly,  for  A  in  G,  but  not  B.  Suppose  none  of  A  and 
B  are  in  G.  Without  loss  of  generality  assume  A  and  B  occur  at  distinct  positions.  Then, 
there  must  be  distinct  labels  e  and  <|>  in  RHS  such  that  P=a*e  and  p=a*<|>.  But  this  implies 
e=4>  which,  by  the  nature  of  labeled  rules,  is  impossible.  Hence  both  A  and  B  are 
contained  in  G,  and  hence  in  E.  Since  E  is  proper,  A=B. 

Case  4.  None  of  A  and  B  is  a  subterm  of  H.  Hence,  there  exist  terms  A1  and  B1  in  E 
such  that  A  is  obtained  by  replacing  all  occurrences  of  G  in  A1  by  H  and  B  is  obtained 
from  B1  similarly.  Since  A*H,  B*H,  the  outermost  label  of  A1  and  B1  is  also  p.  Since  E 
is  proper  A1=B1.  Hence  A=B.  QED. 

Lemma  10.  Let  P  be  an  LDF*  program.  Let  a  proper  term  E  reduce  to  F  in  an  NA-step. 
Then  F  is  a  proper  term. 

Proof.  By  Lemmas  8  and  9,  F  is  a  proper  term.  QED. 

Lemma  11.  Let  P  be  an  LDF*  program.  Let  E0  be  a  proper  term.  Let  E0,El,..,Ek  be  an 
NA-derivation.  Then,  in  this  reduction,  an  FL-pair  is  eliminated  at  most  once. 

Proof.  By  induction  on  length  k  of  E0,El,..JEk.  If  k=0,  then  clear.  Otherwise,  assume 
the  theorem  for  El,..,Ek.  Let  select(E0,G),  G=>H  and  let  El  be  obtained  by  replacing  all 
occurrences  of  G  in  E0  by  H.  Let  the  outermost  function  symbol  of  G  be  f  and  its 
outermost  label  be  a.  Hence,  <f,a>  is  the  pair  eliminated  in  the  reduction  E0.E1. 

If  we  can  show  that  there  is  no  term  f(a,tl,..,tm)  in  El,..,Ek,  then,  by  induction 
hypothesis,  we  can  conclude  that  no  FL-pair  is  eliminated  more  than  once  in  E0,El,..J5k. 
To  show  this,  it  is  sufficient  to  show  that  the  label  a  never  occurs  in  El,..,Ek.  Since  E0  is 
proper,  and  <f,ot>  is  eliminated,  a  does  not  occur  in  El.  Let  %  be  a  label  in  E2,..,Ek. 
Then,  either  £=p  for  some  label  P  in  El  in  which  case  £*a.  Otherwise,  £=P*e  for  some 
label  P  in  El  and  label  e.  We  show  that  it  is  not  possible  that  P*£=a. 

Case  1.  P  occurs  in  E0.  Since  E0  is  proper,  p  is  not  a  proper  initial  segment  of  a.  Hence, 
it  is  not  possible  that  p*e=a. 

Case  2.  P  does  not  occur  in  E0.  Then,  by  the  nature  of  LDF*  rules,  P=a*8,  for  some 
label  8.  Hence,  p*e  is  longer  than  a,  and  so  p*e*a.  QED. 

Theorem  2.  Minimality  of  LDF*.  Let  P  be  an  LDF*  program.  Let  E0  be  a  proper  term. 
Let  E0,El,..,Ek  be  a  successful  reduction.  Then  there  exists  a  successful  leftmost  NA- 
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derivation  E0,Fl,..,Fm  such  that  m=<k. 

Proof.  Since  E0,El,..,Ek  is  a  successful  reduction,  by  reduction-completeness  for  F*, 
Theorem  2,  Section  n,  there  exists  a  successful  N-reduction  E0,Gl,..,Gn.  Let  the  E-set  of 
E0,El,..,Ek  be  SI  and  that  of  E0,Gl,..,Gn  be  S2.  Then,  by  Lemma  6,  S2  is  a  subset  of 
SI.  By  directionality  of  DF*,  there  exists  a  successful  leftmost  N-reduction  E0,Hl,..,Hn. 
By  Lemma  7,  its  E-set  is  also  S2. 

By  Theorem  1  above,  there  exists  a  successful  leftmost  NA-derivation  E0,Fl,..,Fm  whose 
E-set,  S3,  is  a  subset  of  S2.  Hence  S3  is  a  subset  of  S 1 .  By  Lemma  1 1 ,  the  size  of  S3  is 
m.  The  size  of  SI  is  a  lower  bound  on  the  number  of  steps  in  E0,El,..,Ek.  Hence  m=<k. 
QED. 

IV .4  EXTENSION  OF  MINIMALITY  RESULT  TO  NORMAL  FORMS 

We  have  shown  that  leftmost  NA-derivations  reduce  proper  terms  to  simplified  forms  in  a 
minimum  number  of  steps.  It  appears  to  be  straightforward  to  extend  this  result  to  normal 
forms. 

E  reduces  to  F  in  an  NAR-step  if  select-r(E,p),  p=>q  and  F  is  the  result  of  replacing  each 
occurrence  of  p  in  E  by  q.  Definitions  of  NAR-derivations  and  leftmost  NAR-derivations 
are  the  obvious  ones.  The  proof  that  leftmost  NAR-reductions  reduce  proper  terms  to 
normal  forms  in  a  minimum  number  of  steps  appears  to  be  very  similar  to  the  above 
proof. 

IV.5  DERIVED  MINIMALITY  OF  DF* 

Lemma  12.  Let  P  be  a  DF*  program  and  EO  a  term,  where  both  P  and  EO  are  ordinary. 
Let  P*  and  EO*  be,  respectively,  their  labeled  versions.  Let  EO-p>El.  Then  there  exists 
El*  such  that  E0*-p*>El*  and  El*  is  a  labeled  version  of  El. 

Proof.  There  exist  G,H  such  that  E1=E0[G/H].  Proceed  by  induction  on  length  of  EO.  If 
EO  is  a  O-ary  function  symbol  g,  then  clear. 

Otherwise,  EO=f(tl,..,tm),  m>0.  Then  EO*=f*(a,tl*,..,tm*)  where  E(f)=f*  and  for  each  i, 
0=<i=<m,  ti*  is  a  labeled  version  of  ti.  Assume  lemma  for  each  of  tl,..,tm. 

Suppose  G  occurs  in  some  ti  in  tl,..,tm,  di=ti[G/H]  and  El=f(tl,..,ti-l,di,ti+l,..,tm).  Then, 
by  induction  hypothesis,  there  exists  di*  such  that  ti*->di*  and  di*  is  a  labeled  version  of 
di.  Let  El*=f*(a,tl*,..,ti-l*,di*,ti+l*,..,tm*).  Clearly,  El*  is  a  labeled  version  of  El. 
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Suppose  G=E0.  Then  there  is  a  rule  f(Ll,..,Lm)=>RHS  such  that  EO  matches  f(Ll,..,Lm) 
with  some  substitution  O  and  H=RHSO.  Let  a  labeled  version  of  this  rule  be 
f*(L,Ll*,..JLm*)=>RHS*.  It  is  easily  verified  that  EO*  matches  the  head  of  this  rule  with 
substitution  <X>*  such  that  <L,a>  is  in  d>*  and  for  each  pair  <X,t>  in  O,  the  pair  <X,t*>  is 
in  d>*  where  t*  is  a  labeled  version  of  t.  It  is  also  easily  verified  that  RHS*<1>*  is  a  labeled 
version  of  RHSd>.  QED. 

Theorem  3.  Derived  minimality  for  DF*.  Let  P  be  a  DF*  program  and  EO  a  term, 
where  both  P  and  EO  are  ordinary.  Let  P*  and  EO*  be,  respectively,  their  labeled  versions 
such  that  EO*  is  proper.  Let  E0,El,..,Ek  be  a  successful  reduction  in  P.  Then  there  exists 
a  successful  NA-derivation  E0*,Fl*,..,Fp*  in  P*  such  that  p=<k. 

Proof.  We  can  ensure  that  EO*  is  a  labeled  version  of  EO  which  is  proper,  simply  by 
choosing  distinct  maximal  labels  for  function  symbols  of  EO.  By  Lemma  12,  there  exists 
a  successful  reduction  E0*,El*,..,Ek*  such  that  for  each  i,  Ei*  is  a  labeled  version  of  Ei. 
Since  P*  is  an  LDF*  program,  and  EO*  is  proper,  by  Theorem  2,  there  exists  a  successful 
NA-derivation  E0*,Fl*,..,Fp*  such  that  p=<k.  QED. 
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V.  COMPILATION  OF  F*  INTO  HORN  CLAUSES 
V.l  INTRODUCTION 

A  very  simple  algorithm  is  described,  which  compiles  F*  programs  into  Horn  clauses  in 
such  a  way  that  when  SLD-resolution  interprets  them,  it  directly  simulates  the  behavior  of 
select  This  is  accomplished  by  compiling  each  F*  rule  into  a  distinct  Horn  clause,  and 
combining  in  that  clause,  information  about  the  logic  of  the  rule,  and  information  about 
the  control  of  select  when  interpreting  that  rule.  Thus,  a  specialized  interpreter  is 
produced  for  each  rule. 

If  the  F*  program  satisfies  restriction  (g)  in  Section  III.2,  the  clauses  resulting  from  its 
translation  can  be  transformed  to  eliminate  all  redundant  backtracking.  If  the  program 
also  satisfies  restriction  (f),  i.e.  is  in  DF*,  SLD-search  trees  automatically  contain  exactly 
one  branch.  All  the  time,  however,  only  pure  clauses  are  produced. 

The  nature  of  logical  variables  is  utilized  to  implement  the  assumption  necessary  for 
minimality.  This  is  that  when  a  term  is  reduced,  all  copies  of  it  are  simultaneously 
reduced.  A  logical  variable  has  the  property  that  when  one  occurrence  of  it  in  a  term  is 
bound  to  some  term,  all  occurrences  of  it  are  simultaneously  bound  to  the  same  term. 
Unfortunately,  use  must  now  be  made  of  a  metalogical  feature  (var),  and  an  extra  logical 
feature  (cut).  This  is  the  only  impure  aspect  in  the  entire  LOG(F)  system.  Consequently, 
SLD-resolution,  augmented  with  these  features,  computes  NA-reductions. 

LOG(F)  is  defined  to  be  a  logic  programming  system  augmented  with  an  F*  compiler, 
and  the  equality  axiom  X=X.  A  ready-made  implementation  of  LOG(F)  is  obtained  by 
implementing  the  F*  compiler  in  Prolog  and  using  Prolog  in  place  of  SLD-resolution. 

Due  to  its  depth-first  search  strategy,  Prolog  may  sometimes  not  be  able  to  simplify  terms, 
even  though  select  would.  However,  ifP  is  in  DF*,  Prolog  always  simplifies  terms 
whenever  select  does. 

In  all  of  the  following,  except  in  Section  V.6,  Prolog  clauses  and  Prolog  are  synonymous 
with  Horn  clauses  and  SLD-resolution.  Only  in  Section  V.6  do  they  refer  to  the 
programming  language. 

V.2  COMPILATION  ALGORITHM 

Let  P  be  an  F*  program.  The  compilation  of  P  into  Prolog  proceeds  in  two  stages. 


Stage  1.  For  each  n-ary,  n>=0,  constructor  symbol  c  in  P,  and  where  Xl,..,Xn  are  distinct 
variables,  generate  the  clause: 
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reduce(c(X  1 ...  ,Xn),c(X  1  ,..,Xn)) 

Stage  2.  Let  f(Ll,..,Lm)=>RHS  be  a  rule  in  P  where  f  is  an  m-ary,  m>=0,  non¬ 
constructor  function  symbol  and  each  of  RHS  and  Ll,..,Lm  is  a  term,  possibly  containing 
variables.  For  each  such  rule  perform  the  following  steps: 

(a)  Let  Al,..,Am  be  distinct  Prolog  variables  none  of  which  occur  in  the  rule.  If 
Li  is  a  variable  let  Qi  be  Ai=Li.  If  Li  is  c(Xl,..,Xn)  where  c  is  a  constructor 
symbol,  and  each  Xi  a  variable,  let  Qi  be  reduce(Ai,c(Xl,..,Xn)). 

(b)  Let  Out  be  a  Prolog  variable  not  occurring  in  the  rule,  and  different  from 
Al,..,Am.  Generate  the  predication  reduce(RHS.Out). 

(c)  Generate  the  clause: 

reduce(f(  A 1 ,. .,  Am),Out)  :-Ql ...  ,Qm,reduce(RHS  ,Out). 

For  example  the  F*  rules: 
append([]  ,X)=>X 

append([UIV],W)=>[Ulappend(V,W)] 
intfrom(N)=>[Nlintfrom(s(N))] . 
if(true,X,Y)=>X. 
if(false,X,Y)=>Y. 

are  compiled  into: 

reduce([],[]). 
reduce([UIV],[UIV]). 
reduce(true,true). 
reduce(false, false). 

reduce(append(A  1  ,A2),Out)  :-reduce(A  1 ,[]),  A2=X  ,reduce(X,Out). 
reduce(append(A  1  ,A2),Out):- 

rcduce(A  1  ,[UIV]),A2=W,reduce([Ulappend(V,W)],Out). 
reduce(intfrom(N),Out):-reduce([Nlintfrom(s(N))],Out). 
reduce(if(T,X,Y),Out):-reduce(T, true), reduce(X, Out). 
reduce(if(T,X,Y),Out):-reduce(T,false),reduce(Y,Out). 

It  can  be  seen  that  where  reduce(f(Al,..,Am),Out):-Ql,..,Qm,reduce(RHS,Out)  is  the 
translation  of  f(Ll,..,Lm)=>RHS,  Ql,..,Qm  represent  the  attempt  to  match  some  term 
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f(tl,..,tm)  with  f(Ll,..,Lm).  If  these  succeed,  the  match  succeeds  with  some  substitution 
a.  Now,  reduce(RHS,Out)  represents  simultaneously,  application  of  a  to  RHS,  and 
recursive  simplification  of  RHSa.  The  correctness  of  compilation  algorithm  is  formally 
proved  in  Section  V.7. 

In  practice,  in  stage  2(a)  if  Li  is  a  variable,  then  Ai  in  f(Al,..,Am)  is  replaced  by  Li,  and 
Ai=Li  is  not  generated.  This  eliminates  a  procedure  call,  and  so  yields  substantially  faster 
code.  However,  proofs  of  propositions  below  are  easier  to  derive  without  this 
optimization. 

V.3  COMPUTING  AND  PRINTING  NORMAL  FORMS 

If  there  is  a  method  to  compute  simplified  forms  of  terms,  it  can  be  applied  repeatedly  to 
compute  normal  forms  of  terms.  This  is  guaranteed  by  reduction-completeness  for 
normal  forms,  Theorem  4,  Section  II.  In  particular,  for  each  m-ary  constructor  symbol  we 
can  add  the  following  rule: 

nf(E,c(Xl,..,Xm)):-reduce(E,c(Tl,..,Tm)),nf(Tl,Xl),..,nf(Tm,Xm). 

Now,  to  compute  the  normal  form  of  a  term  E,  we  can  execute  nf(E,X),  where  X  is  a 
variable.  The  correctness  of  this  rule  for  computing  normal  forms  can  easily  be  proved 
from  the  arguments  of  Section  V.7. 

Clearly,  computing  normal  forms  is  only  sensible  when  they  are  finite.  If  they  are  not,  we 
can  at  least  print  finite  portions  of  them  as  they  are  generated.  For  example,  we  can  print 
members  of  an  infinite  list  as  follows: 

print_list(X):-reduce(X,[UIV]),write(U),write(’  ’),print_list(V). 

V.4  OPTIMIZING  RULES  SATISFYING  RESTRICTION  (g) 

Let  P  be  an  F*  program  and  PC  its  compiled  version.  Let  f(tl  l,..,tli,..,tlm)=>RHSl, .., 
f(tnl,..,tni,..,tnm)=>RHSn  be  the  n  rules  defining  f  in  P,  and  Cl,..,Cn  be,  respectively, 
their  compiled  versions.  Let  the  rules  satisfy  restriction  (g),  Section  III.2.  Then,  if  tli  is  a 
variable,  the  ith  literal  in  bodies  of  Cl,..,Cn  will  be,  respectively,  Ai=tli,..,Ai=tni,  for 
some  variable  Ai.  Otherwise,  the  ith  literals  in  Cl,..,Cn  would  be,  respectively, 
reduce(Ai,tli),..,reduce(Ai,tni). 

If  tli  is  not  a  variable,  the  query  reduce(f(al,..,ai,..,am),Z)  may,  due  to  backtracking, 
cause  evaluation  of  each  of  reduce(ai,tli),..,reduce(ai,tni).  We  can  ensure  that  reduce  is 
called  just  once  for  ai  by  taking  advantage  of  the  fact  that  all  reduce  clauses  have  the 
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same  form.  That  is,  we  can  collapse  them  all  into  the  single  clause: 

reduce(f(A  l,..,Am),Z):-Rl  ,..,Rm,f(X  1  ,..,Xm)=>RHS,reduce(RHS,Z). 

where  Xl,..,Xm  are  distinct  variables  not  occurring  in  any  of  the  clauses,  and  if  tli  is  a 
variable,  Ri  is  Ai=Xi,  otherwise  Ri  is  reduce(Ai,Xi).  Now  reduce  would  be  called  just 
once  for  ai.  Of  course,  the  =>  rules  now  need  to  be  included  with  the  reduce  clauses. 
Thus  Prolog  execution  can  be  considerably  speeded  up. 

Furthermore,  if  P  is  a  DF*  program  then  f(Xl,..,Xm)=>RHS  will  succeed  at  most  once. 
Hence,  for  any  ground  terms  tl,..,tm,  and  variable  Z,  the  search  tree  rooted  at 
reduce(f(tl,..,tm),Z)  will  contain  exactly  one  branch.  Thus,  the  reduce  clauses  would  form 
a  deterministic  logic  program.  For  example,  consider  the  DF*  program: 

append([],X)=>X. 

append([UIV],W)=>[Ulappend(V,W)]. 

Its  compiled  version,  excluding  rules  for  constructor  symbols,  is: 

reduce(append(A  1  ,A2),Z):-reduce(A  1  ,[]),A2=X  ,reduce(X,Z). 
reduce(append(  A 1 ,  A2)  ,Z)  :- 

reduce(A  1  ,[UIV]),A2=W,reduce([U  lappend(V,W)],Out). 

These  two  rules  can  be  collapsed  into  a  single  one: 
reduce(append(  A 1  ,A2),Z) :- 

reduce(A  1»X1)  ,A2=X2,append(X  1  ,X2)=>RH  S  ,reduce(RHS,Z). 

Now,  given  the  query  reduce(append([l],[2]),Z),  an  attempt  would  be  made  to  simplify 
[1]  just  once,  and  not  twice,  as  with  the  original  pair  of  reduce  clauses.  Also,  since  the 
append  rules  are  in  DF*,  the  SLD-search  tree  rooted  at  reduce(append([l],[2]),Z) 
contains  exactly  one  branch. 

V.5  COMPUTING  FUNCTIONS  EAGERLY  IN  F* 

If  a  function  is  defined  in  F*,  it  is  computed  lazily.  Often  it  is  very  desirable  that  some 
functions,  such  as  arithmetic  functions,  be  computed  eagerly.  We  show  one  way  to 
accomplish  this. 

A  lazy  function  symbol  is  one  which  is  defined  in  F*.  An  eager  function  symbol  is  one 
which  is  defined  in  Prolog.  Only  right  hand  sides  of  F*  rules  can  contain  calls  to  eager 
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functions.  Let  E  be  a  subterm,  possibly  containing  variables,  of  the  right  hand  side  of  an 
F*  rule.  Let  the  outermost  function  symbol  of  E  be  eager.  Then  E  must  not  contain  any 
lazy  function  symbol.  For  example,  where  length  is  eager,  and  append  is  lazy,  the  term 
length(append([],[l]))  must  not  appear  in  any  F*  rule. 

Now,  let  LHS=>RHS  be  an  F*  rule,  f  an  eager  function,  and  f(tl,..,tn)  a  subterm,  possibly 
containing  variables,  of  RHS.  Let  f  be  defined  by  an  n+1  ary  predicate  symbol 
p(Al,..,An,A),  such  that  Al,..,An  are  input  positions  and  A  the  output  position.  Let 
RHS1  be  the  result  of  replacing  f(tl,..,tn)  in  RHS  by  X,  where  X  is  a  variable  not 
occurring  in  LHS=>RHS.  Generate  the  condition  p(tl,..,tn,X),  and  add  it  to  the 
conditions  generated  in  Stage  2  (a)  of  Section  V.2.  Of  course,  if  tl,..,tn  themselves 
involve  calls  to  eager  functions,  they  must  be  treated  similarly.  For  example,  let  multiple 
be  an  eager  function  defined  in  Prolog  as  follows: 

multiple(A,B,true):-0  is  A  mod  B. 
multiple(A,B, false): -not(0  is  A  mod  B). 

Now  the  rule: 

filter(A,[UIV])=>if(multiple(U,A),filter(A,V),[Ulfilter(A,V)]). 

is  compiled  into: 

reduce(filter(A,X),Z):- 

reduce(X,[UIV]), 

multiple(U,A,T), 

reduce(if(T,filter(A,V),[Ulfilter(A,V)]),Z). 

However,  some  care  still  needs  to  be  exercised.  For  example,  where  zerop  and  /  are  eager 
functions,  defined  in  Prolog  by,  respectively,  zerop  and  div,  the  rule: 

f(X)=>if(zerop(X),[X],[  1/X]). 

will  be  compiled  into: 

reduce(f(X),Z):-zerop(X,T),div(l,X,A),reduce(if(T,[X],[A]),Z). 

Now,  if  X  is  0,  the  call  to  div  will  cause  an  unintended  division  by  0.  But  one  can  rewrite 
the  above  rule  as: 


f(X)=>if(zerop(X),[X],h(X)). 
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h(X)=>[l/X]. 

V.6  COMPILING  LDF*  PROGRAMS 

We  now  show  how  to  represent  labeled  terms  in  Prolog,  and  compile  LDF*  programs 
into  Prolog  in  such  a  way  that  NA-steps  can  be  performed  efficiently.  The  main  idea  is 
that  labels  can  be  represented  by  logical  variables.  These  have  the  property  that  if  one 
occurrence  of  a  variable  in  term  E  is  bound  to  term  F,  all  occurrences  of  the  variable  in  E 
are  simultaneously  bound  to  F. 

Let  E  be  a  proper  term  and  let  E  reduce  to  F  in  an  NA-step.  Then  there  is  a  subterm  G  of 
E  such  that  G=>H,  and  F  is  obtained  by  replacing  all  occurrences  of  G  in  E  by  H.  Note 
that  each  of  G,H,F  is  proper.  Let  E  contain  the  labels  al,..,an.  Let  Vl,..,Vn  be  distinct 
variables  and  E*  the  result  of  replacing  for  each  i,  all  occurrences  of  ai  in  E  by  Vi.  Then 
E*  is  a  Prolog  representation  of  E.  Similarly,  let  G*,H*,F*  be  Prolog  representations  of 
G,H,F  respectively  such  that  H*  and  E*  do  not  have  any  variables  in  common.  Then 
G*=f(V,tl,..,tm)  where  V  is  a  variable.  If  we  now  bind  V  to  H*,  all  occurrences  of  V  in 
E*  are  bound  to  H*.  Let  the  result  be  FI*. 

Now,  before  attempting  to  match  a  term  with  a  non-variable  term,  we  take  the  precaution 
of  checking  whether  its  label  is  already  bound  to  some  term.  If  so,  we  attempt  to  match 
this  term  with  the  non-variable  term.  Otherwise,  we  proceed  as  usual.  Thus,  after  V  has 
been  bound  to  H*,  if  another  occurrence  of  G*  is  to  be  matched  with  some  term,  we 
attempt  to  match  H*  with  it. 

At  a  later  stage  it  is  possible  that  the  label  of  H*  itself  be  bound  to  a  term.  Thus,  before 
matching  a  term,  it  may  be  necessary  to  "dereference"  its  label  a  number  of  times.  It  is 
not  unreasonable  to  assume  that  the  cost  of  dereferencing  is  small  compared  to  that  of 
reduction.  Thus,  we  can  work  with  FI*  instead  of  F*,  so  replacement  of  all  occurrences 
of  a  term  is  implemented  efficiently.  Moreover,  F*  can  be  obtained  from  FI*  by 
dereferencing.  The  algorithm  for  compiling  LDF*  programs  can  be  found  in  [Narain 
1988].  In  practice,  DF*  programs  can  be  compiled  directly  into  reduce  clauses  with 
labels,  without  first  transforming  them  into  LDF*  programs.  The  appropriate  algorithm 
can  easily  be  worked  out. 

It  will  be  recognized  that  our  scheme  for  implementing  NA-derivations  is  exactly  the 
graph-reduction  scheme  with  indirection  nodes  described  in  [Turner  1979]  and 
[O’Donnell  1982],  However,  in  our  case  it  is  possible  to  ensure  that  the  length  of  the 
dereferencing  chain  is  exactly  one.  Details  can  be  obtained  in  [Narain  1988].  For 
example,  where  nil  is  a  zero-ary  constructor  symbol,  let  P  be  the  following  DF*  program: 
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merge(nil,nil)=>nil. 

double(X)=>merge(X,X). 

h=>d. 

This  is  compiled  into: 

(1)  reduce(merge(V,Al,A2),Z):-not  var(V),reduce(V,Z),!. 

(2)  reduce(double(V,Al),Z):-not  varCV),reduce(V,Z),!. 

(3)  reduce(h(V),Z):-not  var(V),reduce(V,Z),!. 

(4)  reduce(nil(N),nil(N)). 

(5)  reduce(merge(Z,Al,A2),Z):- 

reduce(Al,nil(Nl)),reduce(A2,nil(N2)),reduce(nil(N3),Z). 

(6)  reduce(double(Z,A  1  ),Z)  :-reduce(merge(N,A  1 ,  A 1  ),Z). 

(7)  reduce(h(Z),Z):-reduce(d(D),Z). 

Consider  the  query  reduce(double(A,h(B)),Z),  which  has  as  descendant 
reduce(merge(N,h(B),h(B)),Z).  Suppose  the  first  call,  reduce(h(B),nil(Nl)),  in  (5) 
succeeds,  but  only  after  a  long  and  complicated  deduction.  Then  B  is  bound  to  nil(Nl). 
Now,  due  to  (3),  the  second  call,  reduce(h(B),nil(N2)),  in  (5)  will  terminate  in  just  three 
inference  steps.  The  cut  (!)  will  prevent  (7)  from  being  tried  all  over  again.  Also  note 
that  of  these  three  inference  steps,  only  one  is  a  dereferencing  step.  The  number  of  these 
steps  is  constant,  regardless  of  the  function  definitions. 

V.7  CORRECTNESS  OF  F*  COMPILATION  ALGORITHM 

Lemma  1.  Let  P  be  an  F*  program.  If: 

(1)  EO=f(tl,..,ti,..,tm),  and 

(2)  Ek=f(sl,..,si,..,sm),  and 

(3)  si  is  simplified,  and 

(4)  E0,..,Ek,  k>=0,  is  an  N-reduction  such  that  for  no  i,  Ei=>Ei+l. 

Then  there  is  a  successful  N-reduction  ti,..,si  of  length  less  than  or  equal  to  the  length  k  of 
E0,El,..,Ek. 

Proof:  By  Lemma  8,  Section  III.  QED. 

Lemma  2.  Let  P  be  an  F*  program,  and  PC  its  compiled  version.  Let  A  be  a  ground  term 
and  B  a  term,  possibly  containing  variables,  such  that  reduce(A,B)  succeeds,  in  the  sense 
of  SLD-resolution,  with  answer  substitution  o.  Then  Ba  is  ground. 
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Proof:  By  induction  on  length  n  of  successful  SLD-derivation  reduce(A,B),Gl,..,Gn=Q 

QED. 

Lemma  3.  Let  P  be  an  F*  program  and  PC  its  compiled  version.  Let  A  and  B  be  ground 
terms  such  that  reduce(A,B)  succeeds.  Let  D  be  a  term,  possibly  containing  variables, 
such  that  for  some  substitution  a,  Da=B.  Then  reduce(A,D)  succeeds  with  answer 
substitution  a. 

Proof:  By  induction  on  length  n  of  successful  SLD-derivation  starting  at  reduce(A,B). 

QED. 

Lemma  4.  Let  P  be  an  F*  program.  Let  PC  be  the  compiled  version  of  P.  Let  E0,..JEn  be 
a  successful  N-reduction.  Then  reduce(EO,En)  succeeds  in  the  presence  of  PC. 

Plan  of  Proof:  By  induction  on  length  of  successful  N-reduction  EO,..  JEn.  We  show  that 
there  is  some  Ej,  j>0,  in  E0,..,En  such  that  an  SLD-derivation  of  reduce(EO,En)  contains 
the  goal  reduce(Ej,En).  Since  Ej,..,En  is  also  a  successful  N-reduction,  by  induction 
hypothesis,  reduce(Ej,En)  succeeds.  Hence  reduce(EO,En)  succeeds. 

Proof:  By  induction  on  length  n  of  successful  reduction  EO,..,En.  If  n=0  then  EO  is 
already  simplified.  In  particular,  E0=c(tl,..,tm)  where  c  is  an  m-ary  constructor  symbol, 
m>=0,  and  tl,..,tm  are  terms.  There  is  a  clause  in  PC  reduce(c(Xl,..,Xm),c(Xl,..,Xm)) 
where  each  Xi  is  a  variable.  Clearly  reduce(E0,E0)  succeeds. 

Let  n>0  and  E0=f(tl,..,tm),  f  not  a  constructor  symbol,  each  ti  a  term  and  m>=0.  Assume 
theorem  holds  for  all  successful  reductions  of  length  less  than  n. 

Since  EO  is  not  simplified,  the  N-reduction  is  of  the  form  EO,..,Ek-l,Ek,..,En,  0<k=<n, 
such  that  Ek-l=>Ek,  but  for  no  i,  0=<i<k-l,  Ei=>Ei+l.  Hence,  Ek-l=f(sl,..,sm)  for  some 
terms  sl,..,sm.  Since  Ek-l=>Ek,  there  is  some  rule  f(Ll,..,Lm)=>RHS  such  that  Ek-1 
matches  f(Ll,..,Lm)  with  some  substitution  a  and  Ek=RHSa.  Since  Ll,..,Lm  do  not  share 
any  variables,  o  is  the  union  of  ol,..,am  such  that  for  each  Li  in  Ll,..,Lm,  si  matches  Li 
with  substitution  oi. 

For  each  i,  if  Li  is  not  a  variable,  then  since  si  matches  Li,  si  is  in  simplified  form.  For 
such  i,  there  is,  by  Lemma  1,  a  successful  N-reduction  ti,..,si  of  length  less  than  or  equal 
to  k- 1 . 


The  rule  f(Ll,..,Lm)=>RHS  is  compiled  into  the  Horn  clause 


reduce(f(X  1 ,. .,Xm),Z):-  Qu { reduce(RHS,Z) } 
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in  accordance  with  the  compilation  rules  stated  above.  This  clause  is  contained  in  PC. 

Consider  the  query  reduce(EO,En),  i.e.  reduce(f(tl,..,tm),En).  It  unifies  with 
reduce(f(Xl,..,Xm)JEn)  with  m.g.u.  x={<Xl,tl>,..,<Xm,tm>,<Z,En>)  and  its  immediate 
descendant  is  (Qu { reduce(RHS ,Z) }  )x.  Since  RHS  does  not  contain  any  of  the  Xi,  this  is 
Qxu  { reduce(RHS,En) } . 

Let  Ql,..,Qm  be  the  members  of  Q.  Consider  some  Qi.  If  Qi  is  Xi=Li,  then  Qix=(ti=Li) 
which  succeeds  with  answer  substitution  {<Li,ti>}.  Of  course,  ti  matches  Li,  so 
{<Li,ti>}=oi. 

Otherwise,  Qi=reduce(Xi,Li),  so  Qix=reduce(ti,Li).  Since  there  is  a  successful  N- 
reduction  ti,..,si  of  length  less  than  or  equal  to  k-1,  by  induction  hypothesis,  reduce(ti,si) 
succeeds.  Since  Liai=si,  by  Lemma  3,  reduce(ti,Li)  also  succeeds  with  answer 
substitution  cri. 

By  repeating  the  same  argument  for  each  Qi,  we  see  that  an  SLD-derivation  starting  at 
reduce(EO,En)  contains  reduce(RHSol,..,<Tm,En)  as  a  member.  Since  a  is  the  union  of 
ol,..,cm  and  no  variable  is  defined  in  more  than  one  oi  in  al,..,om, 
RHSal,..,am=RHSa.  But  RHSa=Ek.  Hence  the  SLD-derivation  starting  at 
reduce(EO,En)  contains  reduce(Ek,En).  Since  the  length  of  the  successful  reduction 
Ek,..,En  is  less  than  n,  by  induction  hypothesis,  reduce(Ek,En)  succeeds.  Thus,  the  query 
reduce(EO,En)  succeeds.  QED. 

Lemma  5.  Let  P  be  an  F*  program.  Let  PC  be  the  compiled  version  of  P.  Let  EO  and  En 
be  terms  such  that  reduce(EO,En)  succeeds  in  the  presence  of  PC.  Then  there  is  a 
successful  N-reduction  E0,..,En. 

Plan  of  Proof:  By  induction  on  length  of  successful  SLD-derivation  reduce(EO,En),..,Q 
We  show  that  there  is  some  goal  reduce(Ej,En),  j>0,  in  this  derivation  such  that  there  is 
an  N-reduction  E0,..,Ej.  Since  reduce(Ej,En)  succeeds,  by  induction  hypothesis,  there  is  a 
successful  N-reduction  Ej,..,En.  So  there  is  a  successful  N-reduction  E0,..,Ej,..,En. 

Proof:  By  induction  on  length  n  of  successful  SLD-derivation  starting  at  reduce(EO,En). 
If  n=l  then  there  is  a  clause  reduce(c(Xl,..,Xm),c(Xl,..,Xm))  in  PC  such  that 
reduce(EO,En)  unifies  with  the  head  of  this  clause.  Clearly,  then,  E0=En,  En  is  simplified 
and  the  required  N-reduction  is  simply  EO. 

Let  n>0.  Assume  lemma  for  all  successful  derivations  of  length  less  than  n.  Assume 
EO=f(tl,..,tm)  for  some  non-constructor  function  symbol  f  and  terms  tl,..,tm.  Since 
reduce(EO,En)  succeeds  there  is  a  clause  in  PC: 
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reduce(f(X  1  ,..,Xm),Z):-Qu{reduce(RHS,Z) } 

such  that  it  is  the  compilation  of  a  rule  f(Ll,..,Lm)=>RHS  in  P.  Moreover, 
reduce(f(tl,..,tm),En)  unifies  with  the  head  of  the  above  clause  with  m.g.u. 
x=  { <X  1  ,t  1  >,. .,<Xm,tm>,<Z,En> }  and  Qx  U  {reduce(RHS,Z)}x  has  a  successful 
derivation  of  length  n-1.  Also,  RHSx=RHS  and  Zx=En. 

If  Q  is  empty,  m=0.  So,  by  restriction  (e)  RHS  is  ground.  By  induction  hypothesis  there  is 
a  successful  N-reduction  RHS,..,En.  EO  matches  f(Ll,..,Lm)  and  so  EO=>RHS.  Hence 
E0,RHS,..,En  is  a  successful  N-reduction. 

Suppose  Q  is  non-empty.  Let  Ql,..,Qm  be  the  members  of  Q.  Consider  Qi.  If  Qi=(Xi=Li) 
then  ti  unifies  with  Li  with  substitution  oi=  { <Li,ti> } .  Construct  the  singleton  sequence 
f(tl,..,ti,..,tm).  This  sequence  is  an  N-reduction. 

If  Qi=reduce(Xi,Li)  then  Li=c(Ul,..,Uk)  for  some  constructor  symbol  c  and  variables 
Ul,..,Uk.  Also  Qix=reduce(ti,Li).  Clearly,  reduce(ti,Li)  succeeds.  Let  the  answer 
substitution  be  oi.  By  Lemma  2,  Licri  is  ground.  Then  reduce(ti,Liai)  also  succeeds.  The 
successful  derivation  of  reduce(ti,Liai)  is  the  same  as  that  of  reduce(ti,Li)  with  Li 
replaced  by  Lioi.  So,  the  length  of  this  derivation  is  also  less  than  n.  By  induction 
hypothesis,  there  is  a  successful  N-reduction  ti,..,Lioi.  By  Lemma  4  of  Section  II,  the 
sequence  f(tl,..,ti,..,tm),..,f(tl,..,Liai,..,tm)  is  an  N-reduction. 

Hence  we  obtain  the  N-reductions  f(tl,..,tm),..,f(Llal,..,tm)  and 
f(Llal,t2,..,tm),..,f(Llal,L2a2,..,sm)  and .. 

f(Llol  JL2o2,..,tm),..,f(Llol,L2a2,..,Lmam).  The  concatenation  of  these  reductions  is 
itself  an  N-reduction.  Since  Ll,..,Lm  do  not  share  variables,  f(Llol,..,Lmom)  matches 
f(Ll,..JLm)  with  a  substitution  which  is  the  union  of  cl,.., am.  Let  a  be  this  union.  Hence 
f(Llol,..,Lmam)=>RHSa.  Since  all  the  variables  of  RHS  are  in  Ll,..,Lm  and  for  each 
oi,  Lioi  is  ground,  RHSa  is  ground. 

The  predication  reduce(RHSc,En)  succeeds  and  the  length  of  the  associated  successful 
derivation  is  less  than  n.  By  induction  hypothesis,  there  is  a  successful  N-reduction 
RHSa,..JBn.  Hence  there  is  a  successful  N-reduction 
f(tl,..,tn),..,f(Llal,..,Lmom),RHSa,..,En.  QED. 

Theorem  1.  The  correctness  of  the  compilation  of  F*.  Let  P  be  an  F*  program  and  PC 
be  its  compilation.  Let  EO  and  En  be  ground  terms.  Then  there  is  a  successful  N-reduction 
beginning  with  EO  and  ending  with  En  iff  PCI-reduce(EO,En). 

Proof:  Lemmas  4  and  5  state,  respectively,  the  if  and  only  if  parts  of  the  theorem.  QED. 


VI.  PROGRAMMING  IN  LOG(F) 


VI.  1  INTRODUCTION 

This  section  describes  seven  examples  of  programming  in  LOG(F).  The  first  illustrates 
non-determinism  of  LOG(F),  and  usefulness  of  lazy  evaluation  even  when  manipulating 
finite  data  structures.  The  second  shows  how  useful  cases  of  the  rule  of  substitution  of 
equals  for  equals  can  be  implemented.  The  third  obtains  a  new  proof  of  confluence  of 
combinatory  logic.  The  fourth  shows  how  a  pair  of  communicating  processes  can  be 
simulated.  The  fifth  illustrates  the  power  of  NA-derivations,  and  manipulation  of  infinite 
numerical  structures.  The  sixth  illustrates  manipulation  of  infinite  graphical  structures. 
The  seventh  compares  LOG(F)  with  the  system  of  Tamaki  [1984],  In  each  case,  clauses 
listed  are  those  obtained  after  performing  optimizations  discussed  in  previous  sections. 
Finally,  Section  VI.9  compares  performance  of  LOG(F)  with  that  of  Prolog. 

VI.2  NON-DETERMINISM  IN  F* 

As  discussed  in  Section  I,  permutations  of  lists  can  be  computed  by  the  following  F* 
program: 

perm([])=>[], 

perm([UIV])=>insert(U,perm(V)). 

insert  (U,X)=>[UIX] . 

insert  (U,[AIB])=>[Alinsert(U,B)]. 

This  is  compiled,  and  optimized  into: 

reduce([],[]). 

reduce([AIB],[AIB]). 

reduce(insert(A,B),[AIB]). 

reduce(insert(A,B),[Clinsert(A,D)]):-reduce(B,[CID]). 

reduce(perm(A),B):-reduce(A,C),perm(C)=>D,reduce(D,B). 

perm([])=>[]. 

perm([AIB])=>insert(A,perm(B)). 

Note  that  some  =>  rules  survive  in  the  compiled  version.  This  is  due  to  the  method, 
discussed  in  Section  V.4,  of  compiling  F*  programs  satisfying  restriction  (g).  If  we  now 
type  reduce(perm([l,2,3]),Z),  we  obtain  Z=[llperm([2,3])],  Z=[2linsert(l,perm([3]))], 
Z=[3linsert(l,insert(2,perm([])))].  However,  if  we  define: 
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make_list(X,[]):-reduce(X,[]). 

make_list(X  ,[UI  V] ):  -reduce(X,  [U  IB  ]),make_list(B ,  V). 

and  then  type  make_list(perm([l,2,3]),Z),  we  obtain  Z=[1,2,3],..,Z=[3,2,1]. 

The  above  program  can  be  used  to  implement  a  very  efficient  solution  to  the  N-queens 
problem  which  is  to  place  N  queens  on  an  NxN  chess  board  so  that  no  two  queens  attack 
each  other.  It  is  easily  seen  that  each  queen  must  be  in  a  distinct  row  and  column,  so  that 
candidates  for  solutions  can  be  represented  by  permutations  of  the  list  [1,2,..,N].  The 
position  of  the  ith  queen  in  a  permutation  p  is  [i,q]  where  q  is  is  the  ith  element  of  q.  The 
problem  now  reduces  to  generating  all  permutations  of  [1,2,..,N]  and  testing  whether  they 
are  safe,  or  represent  a  solution. 

Lazy  evaluation  guarantees  that  permutations  are  tested  as  soon  as  they  are  generated.  If 
it  is  determined  that  [Al,..,Am],  m=<N  is  unsafe  then  no  permutation  with  [Al,..,Am]  as 
initial  segment  is  generated.  This  yields  a  drastic  pruning  of  the  search  space.  The 
program  is: 

if(true,X,Y)=>X. 

if(false,X,Y)=>Y. 

queens(X)=>safe(perm(X)). 

safe([])=>[]. 

safe([UI  V])=>[Ulsafe(nodiagonal(U,  V,  1 ))]. 
nodiagonal(U ,[]  ,N)=>[  ] . 

nodiagonal(U,[AIB],N)=>if(noattack(U,A,N),[Alnodiagonal(U,B,N+l)],none). 

noattack(U,A,N)=>neg(equal(abs(U-A),N)). 

This  is  compiled  into: 

reduce([],[]). 
reduce([UIV],[UIV]). 
reduce(true,true). 
reduce(false, false). 

reduce(queens(A),B):-queens(A)=>C,reduce(C,B). 

reduce(safe(A),B):-reduce(A,C),safe(C)=>D,reduce(D,B). 

reduce(if(A,B,C),D):-reduce(A,E),if(E,B,C)=>F,reduce(F,D). 

reduce(noattack(A,B,C),D):-noattack(A,B,C)=>E,reduce(E,D). 

reduce(nodiagonal(A,B,C),D):-reduce(B,E),nodiagonal(A,E,C)=>F,reduce(F,D). 


queens(A)=>safe(perm(A)). 
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safe(  [])=>[]. 

safe([A!B])=>[Alsafe(nodiagonal(A,B,  1 ))] . 

if(true,A,B)=>A. 

if(false,A,B)=>B. 

nodiagonal(A,[],B)=>[], 

nodiagonal(A,[BIC]  ,D)=> 

if(noattack(A,B,D),[Blnodiagonal(A,C,E)],none):-E  is  D+l. 
noattack(A,B,C)=>D:-E  is  A-B,abs(E,F),equal(F,C,G),neg(G,D). 

The  eager  functions  are  defined  in  Prolog: 

abs(X,X):-X>=0. 
abs(X,Y):-X<0,Y  is  -X. 
neg(true, false). 
neg(false,true). 
equal(A,A,true). 
equal(A,B,false):-not  A=B. 
less_than(U,A,true):-U<A. 
less_than(U,A,false):-U>=A. 

If  we  now  type  make_list(queens([l,2,3,4]),Z),  we  obtain  Z=[2,4,l,3]  and  Z=[3, 1,4,2]. 

VI.3  IMPLEMENTING  SUBSTITUTION  OF  EQUALS  FOR  EQUALS 

If  a  DF*  program  is  interpreted  as  an  equality  theory,  reduce  clauses  can  be  thought  of  as 
implementing  an  equality  theory  in  Prolog  with  the  restriction  that  it  be  used  only  for 
simplification  of  terms.  Now,  given  a  clause  of  the  form  p(c(Xl,..,Xm)):-Body,  where  c 
is  a  constructor  symbol,  we  can  add  another  clause  stating  a  rule  of  substitution  of  equals: 

p(X):-reduce(X,c(X  1  ,..,Xm)),p(c(X  1 ...  ,Xm)). 

Now,  even  when  a  term  E  is  not  of  the  form  c(Xl,..,Xm),  p  can  still  be  inferred  for  E, 
provided  E  is  reducible  to  a  term  of  the  form  c(Xl,..,Xm).  For  example,  from: 

married(X) :  -  spouse(X,  Y) . 
spouse(scott.a). 

one  can  infer  married(scott).  One  can  now  add  the  clause: 
married(X):-reduce(X,Y),married(Y). 
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An  equality  theory  is: 

author(waverley)=>author(ivanhoe). 

author(ivanhoe)=>scott. 

The  reduce  clauses  for  the  last  two  =>  rules  arc: 

reduce(scott, scott) . 

reduce(ivanhoe,ivanhoe). 

reduce(waverley,waverley). 

reduce(author(X)  X)  :  -reduce(X ,  waverley  ),reduce(author(ivanhoe)  ,Z) . 
reduce(author(X),Z):-reduce(X,ivanhoe),reduce(scott,Z). 

Here  scott,  waverley,  and  ivanhoe  arc  constructor  symbols.  Now  one  can  infer,  in  Prolog, 
married(author( waverley)),  i.e.  the  result  of  substituting  author( waverley)  for  scott  in 
married(scott). 

VI.4  COMBINATORY  LOGIC 

A  new  proof  is  obtained  of  the  theorem  that  the  SKI  calculus  is  confluent.  Following  the 
ideas  of  Ait-Kaci  &  Nasr  [1986],  SKI  reduction  rules  can  be  expressed  as  a  DF* 
program: 

apply(k,X)=>kl(. ' 
apply  (k  1  (X) ,  Y)=>X. 
apply(s  JF)=>s  1  (F). 
apply(sl  (F),G)=>s2(F,G). 

apply(s2(F,G),X)=>apply(apply(F,X),apply(G,X)). 

Here  k,sjkl,sl,s2  are  constructor  symbols,  and  apply  a  non-constructor  symbol.  From 
confluence  of  DF*,  it  follows  that  the  SKI  calculus  is  also  confluent.  These  rules  are 
translated  into  the  following  reduce  clauses: 

reduce(s,s). 
reduce(k,k). 
reduce(k  1  (X)Jc  1  (X)). 
reduce(s  1  (X),s  1  (X)). 
reduce(s2(X,Y),s2(X,Y)). 


reduce(apply(A,B),Z):-reduce(A,k),reduce(kl(B),Z). 
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reduce(apply(A,B),Z):-reduce(A,kl(D)),reduce(D,Z). 

reduce(apply(A,B),Z):-reduce(A,s),reduce(sl(B),Z). 

reduce(apply(A,B),Z):-reduce(A,sl(C)),reduce(s2(C,B),Z). 

reduce(apply(A,B),Z):- 

reduce(A,s2(D,E)),reduce(apply(apply(D,B),apply(E,B)),Z). 

These  clauses  can  be  used  to  contemplate  higher-order  programming  in  LOG(F). 

VI.5  TWO  WAY  COMMUNICATION 

This  example  models  communcation  between  two  users,  each  of  whom  types  a  stream  of 
tokens  on  his  screen.  Each  token  is  of  the  form  [A]  or  [send.M]  in  which  case  M  appears 
on  both  screens.  The  communication  is  modeled  by: 

extract_messages([[  A]IX]  )=>extract_messages(X). 
extract_messages([[send,M]IX])=>[Mlextract_messages(X)]. 

screen  1  =>fair_merge(key  1  ,extract_messages(key2)). 
screen2=>fair_merge(key2,extract_messages(key  1 )). 

Here  send,  []  and  I  are  constructor  symbols.  We  assume  there  exists  a  function  fair_merge 
which  takes  as  input  two  streams  and  interleaves  their  tokens  into  an  output  stream.  If  two 
tokens  appear  in  some  order  in  an  input,  then  they  appear  in  the  same  order  in  the  output. 
Finally,  fair_merge  consumes  each  input  at  the  rate  at  which  it  is  produced. 

Note  that  the  second  extract_messages  rule  has  a  left  hand  side  of  depth  greater  than  two, 
so,  strictly  speaking,  it  is  not  an  F*  rule.  However,  it  can  be  expressed  in  F*  as  follows: 

extract_messages([AIX])=>g(A,X). 

g([UIV],X)=>h(U,V,X). 

h(send,V,X)=>[Vlextract_messages(X)]. 

Here  g  and  h  are  auxiliary  function  symbols.  Assuming  that  keyl  and  key2  axe  streams 
of  tokens  typed  by,  respectively,  the  first  and  second  user,  the  term  screen  1  will  reduce  to 
the  stream  of  tokens  appearing  on  the  first  user’s  screen.  Similarly  for  screen2.  The 
reduce  clauses  are: 

reduce([],[]). 

reduce([UIV],[UIV]). 

reduce(send,send). 
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reduce(extract_messages(A),B):- 

reduce(A,[CID]),reduce(C,[E]),reduce(extract_messages(D),B). 

reduce(extract_messages(A),[Blextract_messages(C)]):- 

rcduce(A,[DIC]),  reduce(D,[EIF]),  reduce(E,send),  reduce(F,[B]). 
reduce(screen  1 ,  A):-reduce(fair_merge(key  1  ,extract_messages(key2)).  A). 
reduce(screen2,A):-reduce(fair_merge(key2,extract_messages(keyl)),A). 

VI.6  HAMMING’S  PROBLEM 

The  problem,  described  in  [Dijkstra  1976],  is  to  generate,  in  increasing  order,  all  those 
numbers  which  are  divisible  by  no  primes  other  than  2,3  or  5.  Dijkstra  states  that  an 
equivalent  problem  is  to  generate  the  sequence  of  numbers,  in  ascending  order,  defined 
by  the  following  axioms: 

(a)  1  is  in  the  sequence 

(b)  If  x  is  in  the  sequence,  then  so  are  2*x,  3*x  and  5*x. 

(c)  The  sequence  contains  no  values  except  those  on  account  of  (a)  and  (b). 

These  axioms  can  be  expressed  by  the  following  DF*  program: 

hamming=>hamming_aux([  1  lhamming] ). 
hamming_aux(X)=> 

merge(times_list(2,X),merge(times_list(3,X),times_list(5,X))). 

merge([UIV],[AIB])=>if(U<A,[Ulmerge(V,[AIB])],merge_aux(U,V,A,B)). 

merge_aux(U,V,A,B)=>if(equal(U,A),[Ulmerge(V,B)],[Almerge([UIV],B)]). 

times_list(N,[])=>[]. 

times  Jlist(N,[UIV])=>[U*Nltimes_list(N,V)]. 

Function  times_list  multiplies  each  element  of  its  input  list  by  a  fixed  number.  Function 
merge  takes  two  lists  in  ascending  order  and  merges  their  elements  in  increasing  order. 
Functions  hamming  and  hamming_aux  are  implementations  of  axioms  (a),(b),(c). 

This  program  illustrates  the  power  ofNA-derivations.  The  definition  of  hamming_aux 
contains  three  occurrences  of  X  on  the  right  hand  side.  If  care  is  taken  that  whenever  the 
term  at  one  occurrence  of  X  is  reduced,  terms  at  the  other  two  occurrences  of  X  are  also 
reduced,  the  list  hamming  is  produced  with  little  overhead.  If  not,  then  the  overhead 
increases  exponentially.  This  can  be  felt  by  comparing  the  speed  with  which  elements  of 
hamming  are  printed  on  the  screen  in  the  two  cases.  The  labeled  version  of  this  program 
is  compiled  into  the  following  reduce  clauses: 
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reduce(hamming(A),B):-not  var(A),B=A,!. 
reduce(hamming_aux(A,B),C):-not  var(A),C=A,!. 
reduce(merge(A,B,C),D):-not  var(A)>D=A>!. 
reduce(merge_aux(A,B,C,D,E),F):-not  var(A),F=A,!. 
reduce(times_list(A,B,C),D):-not  var(A)JD=A,!. 
reduce(if(A,B,C,D),E):-not  var(A),E=A,!. 

reduce([],[]). 
reduce([AIB],[AIB]). 
reduce(true,true). 
rcduce(false, false). 

reduce(hamming(A),A):-hamming(B)=>C,reduce(C,A). 

reduce(hamming_aux(A,B),A):-hamming_aux(C,B)=>D,reduce(D,A). 

reduce(merge(A,B,C),A):- 

reduce(B,D),reduce(C,E),merge(F,D,E)=>G,reduce(G,A). 
reduce(merge_aux(  A,B  ,C,D,E),  A): -merge_aux(F,B  ,C,D,E)=>G,reduce  (G,A). 
reduce(times_list(A,B,C),A):-reduce(C,D),times_Iist(E,B,D)=>F>reduce(F,A). 
reduce(if(A,B,C,D),A):-reduce(B,E),if(F,E,C,D)=>G,reduce(G,A). 

hamming(A)=>[  1  lhamming_aux(B,hamming(C))]. 
hamming_aux(A,B)=> 

merge(C,times_list(D,2,B), 

merge(E,times_list(F,3,B),times_list(G,5,B))). 

merge(A,[BIC],[DIE])=> 

if(F,G,[Blmerge(H,C,[DIE])],merge_aux(I,B,C,D,E)):-less_than(B,D,G). 

merge_aux(A,B,C,D,E)=> 

if (F,G,  [B  Imerge  (H,C,E)]  ,[D  lmerge(I,  [B  1C]  ,E)] ) :  -equal(B  ,D,G). 
times_list(  A,B  ,[])=>[]• 

times_list(A,B,[CID])=>[Eltimes_list(F,B,D)]:-E  is  C*B. 

if(A,true,B,C)=>B. 

if(A,false,B,C)=>C. 

Definitions  of  the  eager  functions  less_than  and  equal  are  as  in  Section  VI.2.  If  we  now 
type  print_list(hamming(_)),  we  obtain  1,2,3,4,5,6,8,9,10,12,... 

VI.7  INFINITE  GRAPHICAL  STRUCTURES. 

Henderson  [1982]  has  shown  how  to  use  functional  programming  for  defining  and 
manipulating  graphical  structures.  In  particular,  he  shows  how  to  construct  Square  Limit, 
an  Escher  woodcut.  We  use  Henderson’s  building  blocks  to  tile  the  x-y  plane  in  an 
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interesting  way.  A  picture  is  represented  by  a  list  of  vectors,  each  of  the  form  v(A,B)~ 
v(X, Y),  where  A,B,X,Y  are  real  numbers.  Transformations  on  pictures,  such  as 
composition,  translation,  scaling,  or  rotation  (about  the  origin)  are  defined  as  follows: 

union([],X)=>X. 

union([FXIRX]  ,Y)=>[FXIunion(  Y,RX)] . 
rotate([], _)=>[]. 

rotate([v(X,Y)--v(A,B)IL],Theta)=> 

[v(X*cos(Theta)-Y*sin(Theta),X*sin(Theta)+Y*cos(Theta))-- 
v(A*cos(Theta)-B*sin(Theta),A*sin(Theta)+B*cos(Theta))lrotate(L, Theta)]. 

translate([]  ,_,_)=>[]. 
translate([v(X,Y)-v(A,B)IL],DxJ)y)=> 

[v(X+Dx,Y+Dy)-v(A+Dx,B+Dy)ltranslate(LJDx,Dy)]. 

scaled],  _,_)=>[]. 

scale([v(X,Y)~v(A,B)IL]  ,Kx,Ky)=> 

[v(X*Kx,Y*Ky)--v(A*Kx,B*Ky)Iscale(L,Kx,Ky)]. 

The  basic  pictures  are  p,q,r,s,  drawn  in  a  36x36  grid,  and  shown  in  order  in  the  top  row  in 
Figure  1.  (The  vectors  can  be  found,  not  unfortunately,  in  Henderson’s  paper,  but  in 
[Robinson  &  Green  1987]).  These  are  combined  by  quartet  into  t,  shown  in  the  second 
row.  The  third  and  fourth  rows  show,  respectively,  block  l(t)  and  block2(t),  the  two  basic 
144x72  rectangles.  row(Block,0)  repeats  Block,  infinitely  often,  at  intervals  of  144  units, 
in  the  x  and  -x  directions.  alt_rows(Row,0)  repeats  a  row  infinitely  often,  at  intervals  of 
144  units,  in  the  y  and  -y  directions.  mosaic(Blockl  ,Block2)  computes  rows  of  Block  1 
and  Block2,  alternates  these,  and  then  composes  these  to  tile  the  x-y  plane.  Figure  2.  The 
program  is: 

rot_pos(X)=>translate(rotate(translate(X,-72,-72),- 1 .57), 72,0). 
rot_neg(X)=>translate(rotate(translate(X,-72,-72),1.57),0,72). 

blockl(X)=>union(X,translate(rot_neg(X),72,0)). 

block2(X)=> 

union(rot_pos(X), 

translate(rotate(translate(rot_pos(X),-72,0),- 1 .57), 72,0)). 

row(Block,N)=>union(translate(Block,144*N,0), 

union(translate(Block,- 1 44*N,0),row(Block,N+ 1 ))). 


alt_rows(Row,N)=>union(translate(Row,0, 1 44*N), 
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union(translate(Row,0,-144*N),alt_rows(Row,N+l))). 

mosaic(Block  1  ,Block2)=>union(alt_rows(row(Block  1 ,0),0), 

translate(alt_rows(row(Block2,0),0),0,72)). 
beside(A,B)=>union(A, translated, 36,0)). 
above(A,B)=>union(A,translate(B,0,36)). 
quartet(Pl*P2,P3,P4)=>above(beside(P3,P4),beside(Pl>P2)). 

t=>quartet(p,q,r,s). 

P=>[..]. 

q=>[.]. 

r=>[..]. 

s=>[..]. 

Note  that  mosaic  computes  an  infinite  row,  an  infinite  number  of  times.  However, 
reduction-completeness  ofDF*  precludes  infinite  runaway.  Vectors  are  displayed  as  they 
are  generated.  The  above  program  is  compiled  into: 

reduce(rotate(A,B),[]) reduce(A,[]). 
reduce(rotate(A,B),[v(CJ))-v(E,F)lrotate(G,B)]) 

reduce(A,[HIG]),  reduce(H,I-J),  reduce(I,v(K,L)), 
reduce(J,v(M,N)),  cos(B,0),  P  is  K*0,  sin(B,Q), 

R  is  L*Q,  C  is  P-R,  sin(B,S),  T  is  K*S, 
cos(B.U),  V  is  L*U,  D  is  T+V,  cos(B,W), 

X  is  M*W,  sin(B,Y),  Z  is  N*Y,  E  is  X-Z, 
sin(B,Al),  B1  is  M*A1,  cos(B,Cl),  D1  is  N*C1,  F is  Bl+Dl. 
reduce(translate(A,B,C),[]):-reduce(A,[]). 
reduce(translate(A,B,C),[v(D,E)--v(F>G)ltranslate(H,B,C)]) 
reduce(A,[IIH]),  reduce(IJ--K),  reduce(J,v(L,M)), 
reduce(K,v(N,0)),  D  is  L+B,  E  is  M+C, 

F  is  N+B,  G  is  O+C. 
reduce(scale(A,B,C),[]) reduce(A.n)- 
reduce(scale(A,B,C),[v(D,E)--v(F,G)lscale(H,B,C)]) 

reduce(A,[IIH]),  reduceflJ-K),  reduce(J,v(L,M)), 
reduce(K,v(N,0)),  D  is  L*B,  E  is  M*C,  F  is  N*B,  G  is  0*C. 

reduce(true,true). 
reduce(false, false). 
reduce([],[]). 
reduce([AIB],[AIB]). 
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reduce(A-B,A--B). 

reduce(v(A,B),v(A,B)). 

reduce(p.A) p=>B,  reduce(B,A). 
reduce(q,A) q=>B,  reduce(B,A). 
reduce(r,A) r=>B,  reduce(B.A). 
reduce(s,A) s=>B,  reduce(B,A). 
reduce(t,A) t=>B,  reduce(B,A). 
reduce(blockl(A),B) blockI(A)=>C,  reduce(C,B). 
reduce(block2(A),B) block2(A)=>C,  reduce(C,B). 
reduce(cycle(A),B) cycle(A)=>C,  reduce(C,B). 
reduce(rot(A),B) rot(A)=>C,  reduce(C,B). 
reduce(rot_neg(A),B) rot_neg(A)=>C,  reduce(C,B). 
reduce(rot_pos(A),B) rot_pos(A)=>C,  reduce(C.B). 
reduce(abovel(A,B),C)  >  above  1(A,B)=>D,  reduce(D,C). 
reduce(alt_rows(A,B),C) alt_rows(A,B)=>D,  reduce(D,C). 
reduce(besidel(A,B),C) beside  1( A, B)=>D,  reduce(D,C). 
reduce(mosaic(A,B)>C) mosaic(A,B)=>D,  reduce(D,C). 
reduce(row(A,B),C) row(A,B)=>D,  reduce(D,C). 
reduce(union(A,B),C) reduce(AJD),  union(D,B)=>E,  rcduce(E,C). 
reduce(quartet(A,B,C,D),E) quartet(A,B ,C,D)=>F,  reduce(F,E). 

union([],A)=>A. 

union([  AIB]  ,C)=>[  Alunion(C,B)3 . 

rot_pos(A)=>translate(rotate(translate(A,-72,-72),-1.57),72,0). 
rot_neg(A)=>translate(rotate(translate(A,-72,-72),  1 .57), 0,72). 
blockl(A)=>union(A,translate(rot_neg(A),72,0)). 
block2(A)=>union(rot_pos(A), 

translate(rotate(translate(rot_pos(A),-72,0),- 1 .57),72,0)). 
row(A,B)=>union(translate(A,C,0), 

union(translate(A,D,0),row(A,E))) 

C  is  144*B,  D  is-144*B,  E  is  B+l. 
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alt_rows(A,B)=>union(translate(A,0,C),union(translate(A,0,D),alt_rows(A,E))):- 
C  is  144*B,  D  is-144*B,  E  is  B+l. 
mosaic(A,B)=>union(alt_rows(row(A,0),0), 

translate(alt_rows(row(B,0),0),0,72)). 
beside(A,B)=>union(A,translate(B,36,0)). 
above(A,B)=>union(A, translated, 0,36)). 
quartet(A,B,CJ))=>above(beside(C,D),beside(A,B))- 
rot(A)=>rotate(A,l  .57). 

cycle(A)=>union(A,union(rot(A),union(rot(rot(A)),rot(rot(rot(A)))))). 

t=>quartet(p,q,r,s). 

P=>[.]. 

q=>[-]- 

r=>[..]. 

s=>[..]. 

VI.8  FIRST  TWO  ELEMENTS  OF  A  LIST 

Tamaki  [1984]  shows  how  to  compile  restricted  equality  theories,  expressed  by  a 
reducibility  predicate  =>,  into  Horn  clauses  with  a  small  search  space.  For  example,  the 
clauses: 

int(N)=>[Nlint(N+ 1 )] . 
first2([X,YIZ],[X,Y]). 

are  compiled  into: 


X=>X 

int(N)=>[NIZ]  :-int(N+ 1  )=>Z 
first2([X,YIZ],[X,Y]) 

Now  the  query  int(l)=>X,first2(X,Y)  succeeds  with  answer  substitution  X=[l,2lint(3)], 
Y=[l,2].  However,  it  succeeds  with  infinitely  many  other  distinct  answer  substitutions, 
e.g.  X=[l,2,3lint(4)],Y=[l,2],  X=[l,2,3,4lint(5)],Y=[l,2]....  In  LOG(F),  however,  we 
would  first  define: 

int(N)=>[N!int(N+ 1 )] 

first2([UIV])=>g(U,V) 

g(U,[AIB])=>[U,A] 


These  would  be  compiled  to: 
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reduce([],[]). 
reduce([UI  V]  ,[UI  V] ). 

reduce(int(N),Z):-Nl  is  N+l,reduce([Nlint(Nl)],Z). 
reduce(first2(X),Z):-reduce(X,[UIV]),  reduce(g(U,V),Z). 
reduce(g(C,X)  ,Z) :  -reduce(X ,  [  AIB  ] )  jeduce([C,A  ]  ,Z). 

Now  the  query  reduce(first2(int(l)),Z),  would  succeed  with  exactly  one  answer 
substitution,  Z=[l,2].  This  is  as  desired. 

VI.9  COMPARING  LOG(F)  PERFORMANCE  WITH  THAT  OF  PROLOG 

Programs  of  similar  length,  and  intellectual  complexity  are  written  in  both  F*  and  in 
Prolog.  The  former  were  compiled  into  Prolog,  and  optimized,  before  being  compared 
with  the  latter.  The  performance  figures  in  Quintus  Prolog  on  a  SUN  3/50  are  listed  in 
the  table  below. 


For  problems  in  which  data  structures  are  always  completely  evaluated,  lazy  evaluation 
cannot  reduce  lengths  of  computation.  Such  problems  include  list  reversal,  or  sorting. 
For  these,  LOG(F)  is,  on  an  average,  five  times  slower  than  Prolog.  However,  the 
slowdown  for  a  given  problem  appears  to  stay  the  same,  regardless  of  the  size  of  the 
input. 


For  problems  in  which  data  structures  need  only  be  partially  evaluated,  e.g.  the  N-queens 
problem,  or  tiling  an  infinite  plane,  lazy  evaluation  can  reduce  lengths  of  computation. 
For  these,  LOG(F)  can  be  faster  than  Prolog  by  factors  which  are  unbounded,  i.e.  grow 
with  input  size,  and  by  factors  which  are  infinite. 


Time  in  milliseconds 

Proloe 

LOG  OF) 

Pto1ok/LOG(F) 

Reverse:  3200  elements 

83 

883 

0.09 

Reverse:  6400  elements 

150 

1755 

0.08 

Ouicksort:  60  elements 

83 

539 

0.15 

Ouicksort:  120  elements 

250 

1261 

0.19 

Sieve:  First  50  primes 

422 

1261 

0.33 

Sieve:  First  100  primes 

1816 

4511 

0.40 

All  permutations  of  [1 .2,3.4 .5] 

427 

516 

0.82 

All  permutations  of  fl  .2,3,4.5.61 

3000 

3172 

0.94 

8-Oueens:  All  solutions 

62783 

17511 

3.58 

9-Queens:  All  solutions 

635144 

86539 

7.33 

15-Queens:  First  solution 

>30  minutes 

30300 

>60 

Infinite  plane:  First  vector 

oo 

-0 

OO 

VI- 13 


Programming  in 


] 


Figure  1.  Some  Graphical  Primitives 
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Figure  2.  Square  Unlimit 


Vn.  SUMMARY  AND  CONCLUSIONS 


A  new  approach  for  combining  logic  programming,  rewriting,  and  lazy  evaluation  is 
described.  It  rests  upon  subsuming  within  logic  programming,  instead  of  upon  extending 
it  with,  rewriting,  and  lazy  evaluation. 

F*  is  a  non-terminating,  non-deterministic  rewrite  rule  system.  The  reduction  strategy  for 
it,  select,  is  reduction-complete.  DF*  is  a  subset  of  F*,  and  is  also  non-terminating.  DF* 
satisfies  confluence,  directedness,  and  minimality.  Reduction-completeness,  and 
minimality  enable  select  to  exhibit,  respectively,  weak  and  strong  forms  of  laziness. 

F*  can  be  compiled  into  Horn  clauses  in  such  a  way  that  when  SLD-resolution  interprets 
them,  it  directly  simulates  the  behavior  of  select.  In  particular,  it  is  made  to  exhibit 
laziness.  LOG(F)  is  defined  to  be  a  logic  programming  system  augmented  with  an  F* 
compiler,  and  the  equality  axiom  X=X.  Since  clauses  obtained  by  compiling  F* 
programs  can  be  called  from  other  logic  programs,  LOG(F)  is  proposed  as  a  combination 
of  logic  programming,  rewriting,  and  lazy  evaluation. 

LOG(F)  offers,  perhaps  for  the  first  time,  an  efficient  implementation  of  lazy  evaluation 
within  a  widely  used  language,  namely,  Prolog.  For  problems  in  which  lazy  evaluation 
cannot  reduce  lengths  of  computation,  LOG(F)  is  somewhat  slower  than  Prolog.  For 
problems  in  which  lazy  evaluation  does  reduce  lengths  of  computation,  LOG(F)  can  be 
faster  than  Prolog  by  factors  which  are  unbounded,  i.e.  grow  with  input  size,  and  factors 
which  are  infinite. 

LOG(F)  can  also  be  used  to  implement  useful  cases  of  the  rule  of  substitution  of  equals 
for  equals.  Confluence  of  DF*  yields  a  new  proof  of  the  confluence  of  combinatory  logic. 
Finally,  DF*  seems  to  be  a  good  candidate  for  implementation  on  parallel  machines.  It 
seems  to  offer  a  reasonable  compromise  between  sequential  execution  and  unbounded 
parallelism.  Due  to  directedness  of  DF*,  arguments  of  f  in  f(tl,..,tm)  can  be  simplified  in 
parallel,  however,  they  would  be  simplified  lazily. 
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